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The present invention includes recombinant proteins derived from Clostridium botulinum toxins. In particular, soluble recombinant 
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MULTIVALENT VACCINE FOR CLOSTRIDWM BOTULINUM NEUROTOXIN 

FIELD OF THE INVENTION 

The present invention relates to the isolation oi^ polypeptides derived from Gosmdium 
5 honiltmim neurotoxins and the use thereof as immunogcns ibr the production of \-accines. 
including multivalent vaccines, and antitoxins. 

BACKGROUND OF THE INVENTION 

The genus L loslridlum is comprised of gram-positi\'e, anaerobic, spore-forming bacilli. 

U) rhe natural habitat of these organisms is the environment and the intestinal tracts of humans 
and other animals. Indeed. Clostridia are ubiquitous: they arc commonly found in soil. dusL 
scwagc. marine sediments, decaying vegetation, and mud. f.SVc t' .ir.. P,H.A, Sneath e/ ul, 
"(lt)\truihwj," Bcrf^cy 's Manual^ nf Systematic Bacterioioj^, Vol. 2. pp. 1141-1200. 
Williams & Wilkins (1986).) Despite the identification of approximately 100 species o! 

15 ( lt>sii uiitu)K only a small number have been recognized as etiologic agents of medical and 
veterinary importance. Nonetheless, these species are associated with very serious diseases, 
including botulism, tetanus, anaerobic cellulitis, gas gangrene, bacteremia, pseudomembranous 
colitis, and clo.stridial gastroenteritis. Table 1 lists some of the species of medical and 
veterinary importance and ihc diseases with which they are associated. As virtually all of 

20 these species have been isolated from fecal samples of apparently healthy persons, some of 
ihcse isolates may be transient, rather than permanent residents of the colonic flora. 



TABLE I 

Closthiiimn Species Of Medical And Veterinar>' Imponancc* 



Species 




C\ ammovalcricitm 


Bacteriuria (pregnant women) 


C *. ars^etuincnsc 


Infected wounds: Bacteremia: Botulism: Infections of amnioiic tluid 


C. hufitrii 


Infected war wounds: Peritonitis: Intcctious processes of the eve. ear 
and prostate 


C *. hcifcfinckikii 


Infected wounds 


C. hiiyrnwnums 


Infected wounds: Abscesses: Gas Gangrene: Bacteremin 


('. htiiuiimtm 


Food poisoning: Botulism (wound, food, infant) 


(*. hmricitm 


Urinary tract, lower respiratory tract, pleura) cavity, and abdominal 
infections: Infected wounds: Abscesses: Bacteremia 


C\ Ludaveris 


Abscesses: Infected wounds 
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TABLE I 



Clostridium Species Of Medical And Veterinary Importance* 



in 



15 



species 


Disease 


C. L urnis 


Soft tissue infections: Bacicremia 


C. t huuvoet 


Blackleg 




AbdominaL cervical, scrotal, pleural, anil other infections: Septicemia: 
I*enionitis: Appendicitis 


C. L fH hicanum 


Isolated from human disease processes, but role in disease unknown. 


i ditficilc 


Antimicrobial-associaied diarrhea: INeiidomcmbranous enterocolitis: 
Bacteremia: Pyotsenic infections 


C iaiUix 


ooii iissuc inieciions 


C '. iihnoii 


-Tint tiNNut; Hiiirciions 


( \*l \'t 'iii it'iini 


Wound infections: Abscesses; Pen Ion it is 


{ Itiivltffit'iHf 


Infected war wounds; Baclereinia: Abscesses 


C ' hiMttivntum 


Infected war wounds: Gas iianiircne: tiinuival plaque isolate 


( '. Ifhfnhs 


Gastrointestinal tract infections 


( ' innocuum 


Gastrointestinal tract infections; Lmpyema 


( * i/TCi:tiiarc 


Penile lesions 


( '. U pmm 


Isolated from human disease processes, but role in disease unknown 


( ' fimuMtm 


Bacteremia; IVnionitis; l*ulmonary intections 


( '. iiui/cmfmtnututn 


Various infectious processes 




Infected wounds: Gas gangrene: Hlackleu. Big head (ovine): Kedwaicr 
disease (bovine) 


( ". ni'fitn unt 


Urinary tract infections; Rectal abscesses 


(' iHtrupitfrifictmt 


Bacteremia: Peritonitis: Infected wounds: Appendiciiis 


i '. fK'rfnnxiQ'ns 


Gas gangrene: Anaerobic cellulitis: Intra-abdominai abscesses; Soli 
tissue infections: l ood poisoning: Necrotizing pneumonia; Lmpyema: 
Meningitis; Bacteremia: Uterine Infections: linieriiis necroiuns: Lamb 
dysentery: Struck: Ovine Enicroioxemia: 


1 Itt/I fl '/f tr'If'ttV 
* . f'ltfri.fiiL lLtl.\ 


Bacteriuria (Pregnant women with bacteremia) 


C. pittnfwum 


Abscesses; Infected wounds: liacieremia 


C. niftttysmn 


Infections of the abdominal cavity, genital tract, lung, and biliarv tract; 
Bacteremia 


( * suna\it)f(>rtm' 


Isolated from human disease pr(»cesses. but role in disease unknown. 


C\ scpticmn 


Gas gangrene: Bacteremia; Suppurative infections; Necroti/inu 
enterocolitis: Bra.vy 


C. stinlcliit 


Gas gangrene: Wound infections; Penile lesii>ns: Baclerenita; 
Abscesses: Abdominal and vaginal infections 
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TABLE I 



( '/(KstnJtitm Species Of Medical And Veterinary Imponancc* 



Species 


Disease 




Appendicitis: Bacteremia: Bone and soft tissue inFeciions: 
Intraperitoneal infections: infected war wounds: Visceral gas i:angrene: 
Renal abscesses 




Gas gangrene: Bacteremia: Endocarditis: central nervous sy stem and 
pleuropulmonary infections: Penile lesions: Infected war wounds: 
Other pyogenic infections 


( sithicrmtnaic 


Elacteremia; Empyema; Biliary tract, sot I tissue and bone infections 


( '. s\mhif>\tft}t 


Liver abscesses: Bacteremia; Infections residiing due to bowel flora 


C fcrntim 


Gas gangrene; Appendicitis: Brain abscesses: Intestinal tract and soft 
tissue infections: Infected war wounds: Periodontitis: Bacteremia 


(' U'funt 


Tetanus: Infected gums and tceih: Corneal ulcemtions: Mastoid and 
middle ear infections: Intraperitoneal infections: fcianus neonatorum: 
Postpartum uterine infections: Soft tissue infections, especial Iv related 
to trauma (including abrasions and lacerations): Infections related to 
use of contaminated needles 


( '. Utcrmosiiccfhtrolvncuni 


Isolated from human disease processes, but role in disease unknown. 



* Compiled frt)m PXl. Lngclkirk ci ul "C'/ux.sif teat ion \ Pnnnplcs ami PntcfKc of Clinical Anucrohic 

1^* !ktiUrttfln\^\\ pp 22-23. Star Publishing Co., Belmont. CA (1902): \. Stephen and R.A Petrovvski. 

"Tn\tns Which Trctvcrw Membranes ant/ Dcre^^itlutc Cells." in liactctial Toxins. 2d cd.. pp 66-67. 

American Society tor Microbiology (1986): R. Bcrkow and A J. Ilelchcr (eds). "Huctcnul Diseases" 

\terek Manual of Diay^nnsts anJ Therapy, I6lh ed.. pp. 1 16-126. Merck Research Laboratories. Kahwav. 

NJ. {\^)^)2): and OH. Sigmund and CM. f-raser (eds.). "Closlridial Infections" Merck ieiennan 
15 Manual. 5lh ed.. pp. 396-409. Merck & Co.. Rahway. N J. (1979) 

in iiiosi cuscs, the palhoueniciiy ofihcsc organisms is related lo ihc release oi powerful 
exotoxins or highly cieslruciive enzynies. Indeed, several species ol the uenus ( loMruiium 
produce toxins and other enzymes ofiireai medical and veterinary significance. (C.L. 
20 ilalheway. Clin. Microbiol. Rev. 3:66-98 (I990).| 

Perhaps because of their significance for human and veterinary medicine, much 
research has been conducted on these loxin.s. in particular those of botidmiim and 
difficile. 

25 C\ bo/ u tin am 

.Several strains oi CloMndittm boiulinum produce toxins of signitkance to human and 
animal health. |C.L. I lalheway. Clin. Microbiol. Rev. 3:66-98 (I990)| The effects of these 
toxins range from diarrheal diseases that can cause destruction of the colon, to paralytic 
effects that can cause death. Particularly at risk for developing clostridial diseases are 
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neonates and humans and animals in poor health (t'..i:.. those suffering from diseases 
associated with old age or immunodeficiency diseases). 

Closiiulium hofuUmim produces the most poisonous biological toxin known. The 
lethal human dose is a mere 10 mg/kg bodywcight for toxin in the bloodstream. Botulinal 
toxin blocks nerve transmission lo the muscles, resulting in tlaccid paralysis. When the toxin 
reaches auway and respiratory muscles, it results in respiratory failure that can cause death. 
|S. Anion, J. Infect. Dis. 154:201-206 ( 1 986)| 

i ' hn/ulimtm spores are carried by dust and are found on vegetables taken trom the 
soil, on fresh fruits, and on agricultural products such as honey. l?nder conditions favorable 
lo the organism, the spores germinate lo vegetative cells which produces toxin. [S. Arnon. 
Ann. Rev. Med. 31:541 (1980)] 

Botulism disease may be grouped into four types, based on the method of introduction 
of toxin into the bloodstream. Food-borne botulism results trom ingesting improperly 
preserved and inadequately heated food that contains binulinal toxin. I here w ere 355 cases of 
food-borne botulism in the United States between 1976 and 1984. |K.L. MacDonald ct uL. 
.Am. J. !:pidemiol. 124:794 (1986).) The death rate due to botulinal loxm is 12% and can be 
higher in particular risk groups. [C\0. Tacket ct ui. Am, J. Med. 76:794 ( 1984). j Wound- 
induced botulism results from C. hotulinum penetrating traumatized tissue and producing toxin 
that is absorlx*d into the bloodstream. Since 1950. thirty cases of wound botulism have been 
reported. |VKN. Swariz, "Anaerobic Sporc-Forminii Bacilli The Closfntlia/' pp. 633-646. in 
li,D. Davis I'/ f//..(eds.), Microhioloy^y. 4th edition, J. 13. l.ippincolt C o. (1990).| Inhalation 
bt^iulism results when the toxin is inhaled. Inhalation botulism has been reported as the result 
t)f accidental exposure in the laboratory |F*. liolzer. Med. Klin. 41:1735 (1962l| and could 
arise if the toxin is used as an agent of biological warfare (D.K. I lan/ et uL. in Botulinum 
and Tetanus Sewotoxins. B.R. DasGupta. ed,. Plenum Press. New York ( 1993). pp. 473-476]. 
Infectious infant botulism results from C * hotulinum coionization of the iniant intestine with 
production of toxin and its absorption into the bloodstream. It is likely that the bacterium 
gains entry when spores are ingested and subsequently germinate. |S. Arnon. J. Infect. I^is. 
154:201 (1986)- 1 There have been 500 cases reported since it was first recognized in 1976. 
I M.N. Swart/., supra.] 
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Infant botulism strikes infants who are three weeks to eleven months old (greater than 
90% of the cases are infants less than six months). |S. Arnon. J. Infect. Dis. 154:201 (1986).! 
It is believed that infants are susceptible, due, in large pan. to the absence of the full adult 
complement of intestinal microHora. The benign microflora present in the adult intestine 
5 provide an acidic environment thai is not favorable to colonization by C\ hotu/inum. Infants 
begin life with a sterile intestine which is gradually colonized by microflora. Because of the 
limited microflora present in early infancy, the intestinal environment is noi as acidic, 
allowing for ( *. bomlmifm spore germination, growth, and toxin production. In this regard, 
some adults who have undergone antibiotic therapy which alters intestinal microflora become 

10 more susceptible to botulism. 

An additional factor accounting for infant susceptibility to infectious botulism is the 
inmiaturiiy t>f the infant immune system. The mature immune sy.stcm is sensitized to 
bacterial antigens and produces proiecti\e antibodies. .Secretory IgA produced in the adult 
intestine has the ability to agglutinate vegetative cells of (* hntulhmm. |S- Arnon. J. Infect. 

I> Dis. 154:201 (1986). | Secretory IgA may also act by preventing intestinal bacteria and their 
products from crossing the cells of the intestine. [S. Arnon. Epidemiol. Rev, 3:45 (1981 ).| 
I he infant immune system is not primed to do this. 

Clinical symptoms of infant botulism range from mild paralysis, to moderate and 
scxerc paralysis requiring hospitalization, to fulminant paralysi.s, leading to sudden death. [S. 

20 Arnon. iLpidemiol. Rev. 3:45 (1981).] 

The chief therapy for .severe infant botulism is ventilatory assistance using a 
mechanical respirator and concurrent elimination of toxin and bacteria using cathartics, 
enemas, and gastric lavage. There were 68 hospitalizations in California for infant botulism 
in a single year with a total co.st of over $4 million for treatment. | f L. Frankovich and S. 

25 Arnon. West. .1, Med. 1 54: 1 03 ( 1 99 1 ). j 

Different strains of Clo.sfnc/htm hoiuUnum each produce antigenically distinct toxin 
designated by the letters A-G. Serotype A toxin has been implicated in 26% of the cases of 
food botulism: types B. E and V have al.so been implicated in a smaller percentage of the food 
botulism cases (II. Sugiyama. Microbiol. Rev. 44:4!9 (1980)]. Wound botulism has been 

30 reportedly cau.sed by only types A or B toxins [H. Sugiyama. supra\. Nearly all ca.ses of 
infant btnulism have been caused by bacteria producing either type A or type B toxin. 
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(Excepiionally. one New Mexico case was caused by Clostridium botidimim producing lypc F 
loxin and another by Chsiridiuni hotulinum producing a type B-type F hybrid.) fS. Arnon. 
Epidemiol. Rev. 3:45 (1981).) Type C loxin atTccls waterfowl caiile. horses and mink. Type 
I) toxin aJTects cattle, and type E toxin aiTects both humans and birds. 

5 A trivalent antitoxin derixed from horse plasma is commercially available from 

Connaughi industries Ltd. as a therapy for toxin types A. B. and \.. However, the antitoxin 
has several disadvantages. First, extremely large dosages must be injected intraxenously 
and/or iniramuscularly. Second, the antitoxin has serious side effects such as acute 
anaphylaxis which can lead to death, and scrum sickness, f inally, the efficacy of the 

10 antitoxin is uncertain and the treatment is costly, |C.(). Tacket et aL, Am. J. Med. 76:794 
(1 984). I 

A hepiavalent equine botulinal antitoxin which uses only the Ftab')2 portion of the 
antibody molecule has been tested by the United States Military. |M. Balady. l^SAMRDC 
Newsletter, p. 6 (1991 ).| Ihis was raised against impure toxoids in those large animals and is 

15 n(»i a high liter preparation. 

A peniavalent human antitoxin has been collected from immunized human subjects for 
use as a treatment for infant botulism. The supply of this antitoxin is limited and cannot be 
expected to meet the needs of all individuals stricken with botulism disease. In addition, 
collection of human sera must involve screening out illV and other potentially serious human 

20 palhogcas. |I\J. Schwarz and S.S. Arnon. Western J, Med. 156:197 (1992).] 

Infant botulism has been implicated as the cause of mortality in some cases of Sudden 
Intnnt Death Syndrome (SIDS. al.so known as crib death). SIDS is oftlcially recognized as 
infant death that is sudden and imexpecled and that remained unexplained despite complete 
post-mortem examination. I he link of SIDS to infant botulism came when tecal or biood 

25 specimens taken at autopsy from SIDS infants were found to contain hniuiinwn organisms 
and/or loxin in 3-4% of cases analyzed. [D.R. Peterson a al.^ Rev. Intect, Dis. 1:630 
( 1979)1 In contrast- only 1 of 160 healthy infants (0.6%) had C. ho/tilimtm organisms in the 
feces and no botulinal toxin. (S. Arnon et ai^ Lancet, pp. 1273-76. June 17. 1978.) 

In developed countries. SIDS is the number one cause of death in children between 

30 one month and one year old. (S. Arnon ei at.. Lanccu pp. 1273-77. June 17. 1978.) More 
children die from SIDS in the first year than from any other single cause of death in the first 
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fourteen years of life. In the United Stales, there are 8.000-10.000 SIDS victims annually. 
Id 

What is needed is an effective therapy against infant botulism that is free of dangerous 
side effects, is available in large supply at a reasonable price, and can be safely and gently 
delivered so that prophylactic application to infants is feasible. 

Immunization of subjects with toxin preparations has been done in an attempt to 
induce immunit}- against botulinal toxins. A C. hoiiilinum vaccine comprising chemically 
inactivated (/.e., formaldehyde-treated) type A. B. C. D and E toxin is commercially available 
for human usage. I Io\ve\er. this vaccine preparation has several disadvantages. First, the 
efficacy ot this vaccine is variable (in particular, only 78% of recipients produce protective 
levels of anii-typc B antibodies following administration of the primary scries). .Second. 
immunizatit>n is painful (deep subcutaneous inoculation is required for administration), with 
adverse reaciioits being common (moderate to severe local reactions occur in approximately 
6% of recipients upon initial injection: this number rises to approximately 1 1% of individuals 
who receive booster injections) [Informational Brochure for the Pentavalent (ABCDF) 
HiHulinum 1 oxoid. Centers for Disease Control). Third, preparation of ihe \accine is 
dangerous as active toxin must be handled by laboratory uorkers. 

W hal is needed are safe and effective vaccine preparations lor administration lo those 
at risk o I exposure lo C *. hotulinum toxins. 

C\ iliffitUc 

C \ Jiffici/c, an organism which gained its name due to difficulties encountered in its 
isolation, has recently been proven u> be an etiologic agent of diarrheal disease. (Snealh c/ 
ai. p. I 165.)- ( * difficile is present in the gastrointestinal tract of approximately 3% of 
healthy adults, and 10-30% of neonates without adverse effect (Swartz. at p. 644): by other 
estimates. L\ difficile is a part of the normal gastrointestinal tlora of 2-10% of humans. jCrf, 
Brooks ct itL, (eds.) "In feci ions C aused hy Anaerobic Bacieria." Juwefz. Melnick i.^ 
Adelhen^ 's Medical Microbioloi^y, I9lh cd.. pp. 257-262, Appleton & Lange. San Mateo. CA 
(IW1).| As these organisms are relatively resistant to most commonly used aniimicrobials. 
when a patient is treated with antibiotics, the other members of the normal gastrointestinal 



WO98A)8540 



PCTAJS97/15394 



flora are suppressed and C c/ifficile nourishes, producing cyiopathic toxins and enteroioxins. 
It lias been found in 25% of cases of moderate diarrhea resulting from treatment with 
antibiotics, especially the cephalosporins, clindamycin, and ampicillin. [M.N. Swartz at 644.] 
Importantly. C. t/iffici/e is commonly a.ssociatcd with no.socomial inl'eciions. The 
5 organism i.s often present in the hospital and nursing home environments and may be carried 
on ihe hands and clothing of hospital personnel who care for debilitated and 
immunocompromised patients. As many of these palienis arc being treated with 
antimicrobials or other chemotherapeutic agents, such transmission of C\ cfifficilc represents a 
significant risk factor for disease. (Engelkirk c7 uL. pp. 64-67.) 

10 ^ ' cfijficiie is associated with a range of diarrhelic illness, ranging from diarrhea alone 

lo marked diarrhea and necrosis of the gastrointestinal mucosa with the accumulation of 
inllammatory cells and fibrin, which forms a pseudomembrane in the aifected area. {Brooks 
cv t//. ) It has been found in over 95% of pseudomembranous cnierocolitis cases. {Swartz. al 
p. 644 ) This occasionally Tata! disease is characterized by diarrhea, multiple small colonic 

15 plaques, and toxic megacolon. (Swartz, at p. 644.) Although stool cultures are sometimes 
used for diagnosi.s. diagnosis is best made by deieclion of the heal labile toxins present in 
fecal nitrates from patients with enterocolitis due lo Lli/ficifc. (Swartz. at p. 644-645: and 
Brooks cf ill., at p. 260.) C. difficile toxins arc cytotoxic for tissue/cell cultures and cause 
enterocolitis when injected intracccally into hamsters. (Swartz. at p. 644.) 

-0 I he enlerotoxicity ol" ( Jif/icila is primarily due to the action of two toxins. 

designated A and B. each of approximately 300.000 in molecular weight. Both arc potent 
cytotoxins, with toxin A possessing direct enterocytotoxic aciiviiy. [I.ycrly ci al.. Infect. 
Immun. 60:4633 (1992)-| Unlike toxin A of (*. pcrfrini^cm, an organism rarely associated 
with antimicrobial-associated diarrhea, the toxin of C* i/iff/ci/c is not a .spore coat constituent 

25 and is not produced during sporulalion. (Swartz. at p. 644.) (' ilifficitc toxin A causes 
hemorrhage. Iluid accumulation and mucosal damage in rabbit ileal lcx)ps and appears to 
increase the uptake of toxin B by the intestinal mucosa. Toxin B does not cause intestinal 
Iluid accumulation, but it is 1000 times more toxic than toxin A lo ii.ssue culture cells and 
causes membrane damage. Although both toxins induce similar cellular effects such as aciin 

30 disaggregation, differences in cell specificity occurs. 
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Both toxins are important in disease. [Borriello el al.. Rev. Infect. Dis., 12(suppl. 
2):S185 (1990): Lyerly ei cil.. Infect. Immun.. 47:349 (1985): and Rolfe. Infect. Immun.. 
59:1223 f 1990).] Toxin A is thought to act first by binding to brush border receptors, 
destro> ing the outer mucosal lax cr. then allowing toxin B to gain access to the underlying 
5 tissue. These steps in pathogenesis would indicate thai the production of neutralizing 

antibodies against toxin A may be sufficient in the prophylactic therapy of CDAD. However, 
antibodies against toxin B may be a necessary additional component lor an eifeclive 
therapeutic against later stage colonic disease. Indeed, it has been reported that animals 
require antibodies to both toxin A and toxin B to be completely protected against the disease. 
10 IKim and Rolfc. Abstr. Ann. Meet. Am. Soc. Microbiol.. 69:62 (1987).] 

( ciifficile has al.so been reported to produce other toxins such as an enierotoxin 
different from toxins A and B |Banno ct uL, Rev. Infect. Dis.. 6(SuppL 1:S1 1-S20 {I984)|. a 
low molecular weight toxin |Rihn c/ al.. Biochem. Biophys. Res. C omm.. 124:690-695 
(I984)|. a motility altering factor [.Kistus a ai, GaslroenteroL, 83:836-843 (1982)]. and 
15 perhaps t»iher toxins: Regardless. ( *. Ji/ficiie gastrointestinal disease is of primary concern. 

It is significant that due to its resistance to most commonly used antimicrobials. (' 
iUfficilc is associated with antimicrobial therapy with virtually all antimicrobial agents 
(although mosi commonly ampicillin. clindamycin and cephalosporins). It is also associated 
w ith di.sease in patients undergoing chemotherapy with such compounds as methotrexate. 5- 
20 lluorouracil. cyclophosphamide, and doxorubicin. |S.M. Pinegold tv uL. C/inual Guiih la 
Anuenyhic Infcciions. pp. 88-89. Star Publishing Co.. Belmont. CA {!992).| 

I reaimem of (' diffkile disease is problematic, given the high resistance of the 
i>rganism. Oral metronidazole, bacitracin and vancomycin have been reported to he effective, 
(f-inegold a t//.. p. 89.) I lowe\ er there are problems associated with treatment utilizing these 
25 compounds. Vancomycin is very expensive, some patients are unable to lake oral medication, 
and the relapse rale is high (20-25%). although it may noi occur for several weeks. Id 

( ' i/i/ficilc disease would be prevented or treated by neutraliz.ing the effects of these 
toxins in the gastrointestinal tract. 1 hu.s, what is needed is an effective iherapy against (\ 
ilifjicilc toxin that is free of dangerous side effects, is available in large supply ai a reasonable 
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price, and can be safely delivered so lhai prophylactic application to patients at risk of 
developing pseudomembranous enterocolitis can be effectively treated. 

DESCRIPTION OF THE DRAWINGS 

f igure 1 shows the reactivity of anii-C hotulinum IgY by Western blot, 
figure 2 slums the IgY antibody liter to i \ hotulhium type A toxoid in eggs, measured 
by t-LiSA. 

l-igure 3 shows the results of T. difficile toxin A neutralization assays. 

Figure 4 shows the results of ( difficile toxin B neutralization assays. 

f igure 5 shows the results of C. difficile toxin li neutralization assays. 

figure 6 is a restriction map off' difficile toxin A gene, showing sequences oi 
primers 1-4 (SHO ID NOS:l-4). 

f igure 7 is a Western bloi of ( difficile U)xin A reactive protein. 

i'igure K shows ( difficile toxin A expression constructs. 

f igure 0 shows C difficile loxin A expression constructs. 

f igure 10 shows the purification of recombinant (' difficile toxin A. 

f igure 1 1 siiows the results of ( '. difficile toxin A neutralization assa\ s with aniibodies 
reactive to lecombinani loxin A, 

figure 12 shows the results for a difficile loxin A neutralization plate. 

f igure 13 shows the results for a C difficile toxin A neutralization plate. 

f igure 14 shows the results ot recombinant ( *. difficile toxin A neutral iz^iiion assays. 

figure 15 shows ( . difficile loxin A expression constructs. 

f igure 16 shows a chromaiograph plotting absorbance at 280 nm against retention lime 
for a pMA 1 870-680 IgY PfXi preparation. 

figure 17 shows two recombinant C. difficile toxin B expression constructs, 
figure IK shows difficile toxin B expression constructs, 
figure 10 shows (* difficile toxin B expression constructs, 
f igure 20 shows C. difficile toxin B expression constructs. 

figure 21 is an SDS-PACil: gel showing the purillcalion t>f recombinant C* difficile 
loxin B fusion protein. 
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Figure 22 is an SDS-PAGE gei showing the purification of two histidinc-iagged 
recombinant C. cfifficHe toxin B proteins. 

F igure 23 shows (\ Jiff idle toxin B expression constructs. 

Figure 24 is a Western blot ol' ilifficile toxin B reactive protein. 
5 Figure 25 sliows ( *. hotidinum type A toxin expression constructs: constructs used to 

proN idc ( ' hoiuihwm or ( * i/if/icilc sequences are also shown. 

Figure '26 is an SDS-PAGB gel stained with Coomaissc blue showing the purification 
oi" recombinant ( hiHidmum type A toxin fusion proteins. 

Figure 27 shows (' hofitli/wm i\pe A toxin expression constructs; constructs used to 
M) provide ( ' homlimim sequences arc also shown. 

l igurc 28 is an SDS-FAGF gel stained with Coomaissc blue showing the purification 
ofpllisBol protein using the Ni-NTA resin. 

I igure 29 is an SDS-PACiH gel stained with Coomaissc blue showing the expression ol 
pIlisBoi protein in BL2i(DE3) and BL2UDF:3)pi.ysS host ceils. 
15 l igurc 30 is an SDS-PAGF gel stained with Coomaissc blue showing the purification 

ol pllisBol protein using a batch absorption procedure. 

l igurc 3! is an SDS-PAGE gel stained with Coomaissc blue showing llie purification 
ol pliisBol and pHisBot( native) proteins using a Ni-NTA column. 

I Igure 32 is an SDS-PAGE gel stained with Coomaissc blue showing the purification 
20 o! pHisBolA protein expressed in pHisBolA(syn) kan laclq T7/pACYCGr«/BE21(DE3) cells 
using an IDA column. 

I igure 33 is an SDS-PAGF^ gel stained with Coomaissc blue showing the purification 
i>i pllisBolA. pHisBotB and pliisBolE proteins by IDA chromatography followed by 
chromatography on S-l()() lo remove folding chaperones. 
25 I igure 34 is an SDS-PAGE gel stained with Coomaissc l^lue showing the extracts 

derived trom pliisBotB amp T7lac/BL2I(DE3) cells before and after purification on a Ni- 
NTA column. 

Figure 35 is an SDS-PAGE gel run under native conditions and stained with 
Coomaissc blue showing the removal of folding chaperones from IDA-puriiled BotB protein 
30 using a S-lOO column. 
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Figure 36 is an SDS-PAGE gel stained with Coomaisse blue showing proteins thai 
eluled during an imidazole step gradient appHed to a IDA column containing a i\ sate of 
pHisBotB kan iaclq T7/pACYCGro/BL2l(DE3) cells. 

l-igure 37 is an SDS-PAGE gel run under native conditions and stained with 
5 C oomaisse blue showing IDA-purifled BolB protein before and after ultrafiltration. 

Figure 38 is an SDS-PAGE gel stained with Coomaisse blue showing the purification 
o( BotE protein using a NiNTA column. 

Figure 39 is ah SDS-PAGE gel stained with Coomaisse blue showing extracts derived 
from pllisBotA kan T7 lac/BL21(DE3) pLvsS cells grown in rermemalion culture. 
U* Figure 40 is a chromatogram showing proteins present after IDA-puriticd Botl: protein 

was applied to a S-U)0 column. 

DEFINITIONS 

l o facilitate understanding of the invention, a number of terms are defined below. 

15 As used herein, the term "neutralizing" is used in reference to antitoxins, particularly 

antitoxins comprising antibodies, which have the ability u> prevent the palhoU>uical actions ol 
the toxin against which the antitoxin is directed. 

As used herein, the term "overproducing" is used in reference to the production ol' 
clostridial toxin polypeptides in a host cell and indicates that the host cell is producing more 

20 of the clostridial toxin by virtue of the introduction of nucleic acid sequences encoding said 
clostridial toxin polypeptide than would be expressed by said host cell absent the introduction 
of said nucleic acid sequences. To allow ease of purification of toxin polypeptides produced 
in a host cell it is preferred that the host cell express or overproduce said toxin polypeptide at 
a level greater than 1 mg/liler of host cell culture. 

25 "A host cell capable of expressing a recombinant protein at a level greater than or 

equal to 5% of the total cellular protein" is a host cell in which the recombinant protein 
represents at least 5% of the total cellular protein. 1 o determme what percentage of total 
cellular protein the recombinant protein represents, the following steps are taken. A total of 
10 ODmh, units of recombinant host cells (L^.i^., 200 yi\ of cells at ()1\k* 30/mli are removed 

30 (at a timei^int known to represent the peak of expression of the desired recombinant protein) 
to a 1.5 ml microfuge tube and pelleted for 2 min at maximum rpm in a microfuge. fhe 
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pellets are resuspended in 1 ml of 50 mM NaHPO^, 0.5 M NaCL 40 mM imidazole buffer 
(pH 6,8) containing I mg/ml lysozyme. The samples arc incubated for 20 min at room 
temperature and stored ON at -10°C. Samples are thawed completely at room temperature 
and sonicated 2X10 seconds with a Branson Sonifier 450 microtip probe at U 3 power 
5 setting. I hc samples are centrifuged for 5 min. at maximum rpm in a microfuue. An aliquot 
(20 ol the protein sample is removed to 20 2X sample buffer (this represents the total 
protein extract). The samples are heated to 95°C for 5 min. then cooled and 5 or 10 ^1 are 
loaded onto 12.5% SDS-PAGE tiels. High molecular weight protein markers arc also loaded 
to allow tor estimation of the MW of identified recombinant proteins. After electrophoresis. 

10 protein is delected generally by staining with Coomassie blue and the stained gel is scanned 
using a densitometer to determine the percentage of protein present in each band. In this 
manner, the percentage of protein present in the band corresponding to the recombinant 
protein of interest may be detennined. It is not necessary thai Coomassie blue be employed 
for the detection of protein, a number of fluorescent dyes ^c^,t,^. Sypro orange S-6651 

15 t.Vlolccular Probes. Lugene. OR| may be employed and the stained gel scanned using a 
lluoroimager |t'..t:.. I luor Imager SI (Molecular Dynamics. Sunnyvale. C'A)|. 

"A host cell capable of expressing a recombinant protein as a soluble protein at a level 
greater than or equal to 0.25% of the total soluble cellular protein ' is a host cell in which the 
amount of soluble recombinant protein present represents at least 0.25% of the total cellular 

20 protein. As used herein "total soluble cellular protein" refers to a clarified PF.I lysale 

prepared as described in Example 3l(c)(iv). Briclly. cells are harvested lollovving induction 
of expression of recombinant protein <at a point of maximal expression). I he cells are 
resuspended in cell resuspension buffer (CRB: 50 mM NaPO^, 0.5 M NaCl. 40 mM 
imidazole. pH 6.8) to create a 20% cell suspension twet v^eighl of cells/volume of CRB) and 

25 ceil lysaies are prepared as described in Example 3l(c)(iv) {Lc. sonicaiion or homogcnization 
followed by cenlrifugationK I'he cell lysate is then flocculalcd utilizing polyethyleneimine 
{V\U) prior to cenlrifugation. PEl (a 2% solution in dH.O. pH 7.5 with IICI) is added to the 
cell lysate to a fuial concentration of 0.2%, and stirred for 20 min at room temperature prior 
10 cenlrifugation 1 8.500 rpm in .lAlO rotor (Beckman) for 30 minutes at 4°C'|. This ircaunenl 

30 / removes RNA. DNA and cell wall components, resulting in a clarified, low vi.scosity lysale 
("PEl clarified lysate"). The recombinant protein present in the PEl clarified lysale is then 
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purified by chromatography on an IDA column for his-taugcd proteins). 1 he amount of 
purified recombinant protein (/.c, the eluted protein) is divided by the concentration of 
protein present in the PHI clarified lysatc (typically 8 mg/ml when using a 20% cell 
suspension as the starting material) and multiplied by 100 to determine what percentage of 
total soluble cellular protein is comprised of the soluble recombinant protein (see Example 
3>b). 

As used herein, the term "fusion protein" refers to a chimeric protein containing the 
protein of interest {Le.. i . boiulinum toxin A. B. C. D, E. V. or Ci and fragments thereof) 
jt>ined to an exogenous protein fragment (the fusion partner which consists of a non-toxin 
protein). The fusion partner may enhance solubility of the ( bntulimim protein as expressed 
in a host cell, may provide an affinity tag to allow purification of the recombinant fusion 
protein from the host cell or culture supernatant, or both. 1 1' desired, the fusion protein may 
be remo\ ed from the protein of interest H e., toxin protein or fragments thereol^ prior to 
immunization b\ a variety of enzymatic or chemical means known to the art. 

/\s used herein the term "non-toxin protein" or "non ioxin protein sequence" refers to 
that portion ol" a fusion protein which comprises a protein or protein sequence which is not 
derived tVom a bacterial toxin protein. 

I he term "protein of interest" as u.sed herein refers to the protein whose expression is 
desired within the lusion protein. In a fusion protein the protein of interest will be joined or 
fused with another protein or protein domain, the fusion partner, to allow Ibr enhanced 
stability of the protein of interest and/or ease of purification of the f usion prolein. 

As used herein, the term "maltose binding protein" refers to the maltose binding 
protein i>f E. a>lL A portion of the maltose binding protein may be added to a protein of 
interest lt> generate a fusion protein; a portion of the mallo.se binding protein may merely 
enhance the solubility of the resulting fusion protein when expressed in a bacterial host. On 
the other hand, a portion of the maltose binding protein may allow affinity purification of the 
lusion prolein on an amylose resin. 

As used herein, the term 'poly-hislidine tract" when used in relerence to a fusion 
protein refers to the presence of two to ten histidine residues at either the amino- or carboxy- 
lerminus of a protein of interest. A poly-histidine tract of six to ten residues is preferred. 
The poly-hisiidine tract is also defined functionally as being a numl^-r of consecutive histidine 
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residues added to the protein of interest which allows the affinity purification of the resulting 
fusion protein on a nickel-chelaic or iDA column. 

As used iicrcin, the term "purified" or "to purify" refers to the removal of 
contaminants from a sample. I'or example, antitoxins are purified by removal of 
5 contaminating non-immunoglobulin proteins; they are also purified by the removal of 

immunoglobulin that does not bind toxin. The removal of non-immunoglobulin proteins 
and/or ihe removal of immunoglobulins that do not bind loxin results in an increase in the 
percent of loxin-rcactive immunoglobulins in the sample. In another example, recombinant 
toxin pol\ peptides are expressed in bacterial host cells and the toxin polypeptides arc purified 

10 by ihc removal of host cell proteins: the percent of recombinant toxin polypeptides is thereby 
increased in the sample. Additionally, the recombinant loxin polypeptides are purified by the 
removiti of host cell components such as lipopolysaccharidc {ci^,. endotoxin). 

I he term "recombinant DNA molecule" as used herein rcters to a UNA molecule 
which is comprised of segments of DNA joined together by means oT molecular biological 

15 techniques. 

I he term "recombinant protein" or "recombinant polypeptide" as used herein refers lo 
a protein molecule which is expressed from a recombinant DNA molecule. 

( he term "native protein" as used herein refers to a protein which is isolated from a 
natural source as opposed lo the production oi a protein by recombinant means. 

20 As used herein the term "portion" when in reference to a protein (as in "a portion of a 

given protein") refers to fragments of that protein. The fragments may range in size from 
lour amino acid residues lo the entire amino acid sequence minus iuie amino acid. 

As used herein "soluble" when in reference to a protein produced by recombinant 
DNA technology in a host cell is a protein which exists in solution in the cytoplasm of the 

25 hosi cell: if the protein contains a signal sequence the soluble protein is exported to the 

periplasmic space in bacteria hosts and is secreted into the culture medium in eucaryolic cells 
capable of secretion or by bacterial host possessing the appropriate genes {i.e., the ki/ gene). 
In ci>ntrasi. an insoluble protein is one which exists in denatured form inside cytoplasmic 
granules t called inclusion bodies) in the host cell. High level expression {i.e.. greater than 10- 

30 20 mg recombinant protcin/liicr of bacterial culture) of rccombinani proteins often results in 
the expressed protein being found in inclusion bodies in the bacterial host cells. A soluble 
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protein is a protein which is not found in an inclusion body inside the host cell or is found 
both in the cytoplasm and in inclusion bodies and in this case the protein may be present at 
high or low levels in the cytoplasm. 

A distinction is drawn between a soluble protein {i.e.. a protein which when expressed 
in a host cell is produced in a soluble form) and a "solubilized" protein. An insoluble 
recombinant protein found inside an inclusion body may be soiubilized (i.e., rendered into a 
soluble form) by treating purified inclusion bodies with denaturants such as yuanidine 
hydrtxrhloride, urea or sodium dodecyl sulfate (SDS). I hese denaturants must then be 
removed from the soiubilized protein preparation to allow the recovered protein to rcnalure 
(refold). Not all proteins will refold into an active conformation after solubilization in a 
denalurani and removal of the denalurani. Many proteins precipitate upon removal of the 
denaturanl. SDS may be used to solubilizc inclusion bodies and will maintain the proteins in 
solution at low concentration. However, dialysis will not always remove all of the SDS (SDS 
can ibrm micelles which do not dialyze out); therefore. SDS-soIubilized inclusion bi>dy 
protein is soluble but not refolded. 

A distinction is drawn between proteins which arc soluble ( i.e., dissolved) in a 
solution devoid of signillcant amounts of ionic detergents (r.t,'., SDS) or denaturants ic.,ii.. 
urea, guanidine hydrochloride) and proteins which exist as a suspension of insoluble protein 
molecules dispersed within the solution. A soluble protein will not be removed from a 
solution containing the protein by centrifugation using conditions sufficient to remove bacteria 
present in a liquid medium (/.t\. centrifugation ai 12.0(){) \ ii for 4-5 minutes), f or example. 
\o test whether two proteins, protein A and protein B, are soluble in soiulitm. the two proteins 
are placed inu> a solution selected from the group consisting of PHS-NaC I (PBS containing 
0.5 M NaCl). PBS-NaCI containing 0.2% Tween 20. PBS. PBS containing 0.2% Tween 20. 
PBS-C (PBS containing 2 mM CaCU). PBS-C containinti either O.l or 0.5 % Tween 20. PBS- 
C containing either 0.1 or 0.5% NP-4(). PBS-C containing either O.l or 0.5% Triton X-IOO. 
PBS-C* containing 0.1% sodium deoxycholate. The mixture containing proteins A and B is 
then cenlrifuged at 5000 \ g for 5 minutes. 1 he supernatant and pellet formed by 
centrifugation are then assayed for the presence of protein A and B. 1 1 protein A is ibund in 
the supernatant and not in the pellet {except for minor amounts [ic, less than I0"/o) as a 
result o! trapping), protein is said to be soluble in the solution tested. If the majority of 
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protein B is found in the pellet (/.t'„ greater than 90%), then protein B is said to exist as a 
suspension in the solution tested. 

As used herein, the term "therapeutic amount" refers lo that amount of antitoxin 
required lo neutralize ihe pathologic effects of one or more clostridial toxins in a subject. 
5 I he term "pyrogen" as used herein refers to a fever-producing substance. Pyrogens 

ma\ he endogenous to the host {t\$^,. prostaglandins) or may be exogenous compounds {c.^.. 
bacterial endo- and exotoxins, nonbacterial compounds such as antigens and certain steroid 
compounds, etc.). The presence of pyrogen in a pharmaceutical solution may be detected 
using the l\S. Pharmacopeia (USP) rabbit fever test (United States Pharmacopeia. Vol. XXIi 

10 (1900) United States Phamiacopeial Convention. Rockvilie. MD, p, 151). 

The term "endotoxin" as used herein refers to the high molecular weijihi complexes 
associated w ith the outer membrane of gram-negative bacteria. I 'npurified endotoxin contains 
lipids, proteins and carbohydrates. Highly purified endotoxin does not contain protein and is 
referred io as lipopolysaccharide (LPS). Because unpuritlcd endotoxin is of concern in the 

15 production ol pharmaceutical compounds proteins produced in E. co/i using recombinant 
DNA technolog> ), the term endotoxin as used herein refers to unpurifled endotoxin. Bacterial 
endotoxin is a well known pyrogen. 

As used herein, the term "endoloxin-free" when used in reference to a composition lo 
he administered parenierally (with the exception of intrathecal administration) to a host means 

20 that the ddse lo be delivered contains less than 5 F;U/kg body weight [FDA (iuidelines for 
Parenteral Drugs i December 1987)1, Assuming a weight of 70 kg for an adult human, the 
dose must ciintatn less than 350 l:L' to meet FDA Guidelines for parenteral administration, 
[indoioxin levels are measured herein using the Limulus Amebocyte Lysate (LAL) test 
(Limulus Amebocyte Lysate Pyroehromc''^\ Associates of Cape Cod. Inc. Woods Hole. MA). 

25 l o measure endotoxin levels in preparations of recombinant proteins, 0.5 ml ol a solution 

comprising 0.5 mg of puritled recombinant protein in 50 niM NaPO^, pH 7.0. 0.3M NaCl and 
10% glycerol is used in the LAL assay according lo the manufacturer's instructions for the 
endpoint chromogenic without diazo-eoupling method |the specific components of the buffer 
containing recombinant protein to be analyzed in the LAL test are not important: any buffer 

30 having a neutral pll may be employed (see for example, alternative buffers employed in 

Lxamples 34. 40 and 45) f Compositions containing less than or equal to than 250 endotoxin 



- 17 - 



wo 98/08540 



PCT/US97/15394 



units (EU)/mj: of purified recombinant protein are herein defined as "substantially endotoxin- 
frec." Preferably the composition contains less than or equal lo 100. and most preferably less 
than or equal to 60. (KU)/mg of purified recombinant protein. Typically, administration oi 
bacterial toxins or toxoids to aduU humans for the purpose ol vaccination involves doses of 

5 about 10-500 yi^ protein/dose. Hiercfore. administration of 10-500 of a purified 

recombinant protein \a a 70 kg human, wherein said purified recombinant protein preparation 
contains 60 RU/mg protein, results in the introduction of only 0.6 to 30 1:IJ (i.e.. 0.2 to 8.6% 
of the maximum allowable endotoxin burden per parenteral dose). Administration of 10-500 
(.ig oi a purified recombinant protein lo a 70 kg human, wherein said purified recombinant 

10 protein preparation contains 250 EU/mg protein, results in the introduction of only 2.5 to 125 
l:U (/.e.. 0.7 to 36% of the maximum allowable endotoxin burden per parenteral dose). 

The LAL test is accepted by the Ij.S. FDA as a means of delecting bacterial 
endoiosins (21 (M .R. sf§ 660,100 -105). Studies have shown thai the LAL lest is equivalent 
or superior to the USP rabbit pyrogen lest for the detection of endotoxin and thus the LAL 

15 lest can be used as a surrogate for pyrogenicity studies in animals [f.C. Perascin, Pyntj^cns: 
cndoffixins, l.AL fcsfinx* and ilvpyro\icnuiion. Marcel Dekker. New Yt^rk (1^)85). pp. 150-155 j. 
Tlie I DA Bureau of Biologies accepts the LAL as.say in place ol the I'SP rabbit pyrogen test 
so long as the LAL assay utilized is shown lo be as sensitive a.s. or more sensitive as ihe 
rabbit test |Ied. Reg.. 3S. 26130 

20 I he term "monovalent" when used in reference to a clostridial vaccine rciers to a 

\ accine which is capable of provoking an immune response in a host animal directed again.sl a 
single type of clostridial toxin. For example, if immunization of a hosi with i\ homiimn)? 
type A it)xin vaccine induces antibodies in the immunized host which protect against a 
challenge with type A toxin but not against challenge with type B. C. D. L. !- or Ci toxins. 

25 then ihe type A vaccine is said to be monovalent. In contrast, a "muliivaleni* vaccine 

provokes an immune response in a host animal directed againsi several {i.e., more than one) 
clostridial toxins. For example, if immunization of a host with a vaccine comprising (' 
hottilimmi type A and B toxins induces the pnxluction ol antibodies which protect the host 
against a challenge with both type A and B toxin, the vaccine is said to be multivalent (in 

30 particular, this hypothetical vaccine is bivalent). 
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As used herein the term "immunogenically-effective amount" refers to that amount of 
an immunogen required to invoke the production of protective levels of antibodies in a host 
upon vaccination. 

The term "protective level", when used in reference lo the le\ el of antibodies induced 
5 upon immunization of the host with an immunogen which comprises a bacterial toxin, means 
a level of circulating antibodies sufficient to protect the host irom challenge with a lethal dose 
of the toxin. 

As used herein the terms "protein" and "polypeptide" refer to compounds comprising 
amino acids joined \ ia peptide bonds and are used intercliangeabiy. 

10 rhe terms "toxin" and "neurotoxin" when used in reference to toxins produced by 

members {i.e., species and strains) of the genus Clostridium are used interchangeably and 
reier lo the proteins which are poisonous to nerve tissue. , 

'] he term "receptor-binding domain" when used in reference to a ( ' hoiuliniim toxin 
reters ii> the carboxy-terminal portion of the hea\ y chain (H^ or the C fragment) of the toxin 

15 which is presumed lo be responsible for the binding of the active loxin (/.c. the derivative 

toxin comprising the M and K chains joined via disulfide bonds) lo receptors on the surface of 
synapiosomes. The receptor-binding domain for C. hafulinum type A toxin is defined herein 
as cimiprising amino acid residues 861 through 12% of SFQ 10 NO:28. The receptor- 
binding domain for ( hotulinum type B toxin is defined herein as comprising amino acid 

20 residues 84S through 1291 of SEQ NO:40 (strain Eklund I7B). 1 he receptor-binding 
domain of ( hotulinum type CT loxin is defined herein as comprising amino acid residues 
K56 through 129] of SLQ ID N():60. fhe receptor-binding domain of C* hi)tulinimi type D 
loxin is defined herein as comprising amino acid residues 852 through 1276 of SKQ II) 
N(>:66. fhc receptor-binding domain of (' hofu/inum type R loxin is defined herein as 

25 comprising amino acid residues 835 through 1250 of SHQ ID NO:50 (Beluga strain). The 
receptor-binding domain of C. hotulinum type V loxin is defined herein as comprising amino 
acid residues 853 through 1274 of SE(^ ID NO:71. The receptor-binding domain of 
hotulinum i> pe Cj loxin is defined herein as comprising amino acid residues 853 through 1297 
of SRQ II) NO:77. Within a given serotype, small variations in the primary amino acid 

30 sequence of the botulinal loxins isolated from different strains has been reported (Whelan cf 
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aL (1992). supra and Minton (1995) Curr, Top. Microbiol. Immunol, 195:161-194]. The 
presenl invention coniemplates fusion proteins comprisina the receptor-bindinu domain ol C'. 
hntulinum toxins from serotypes A-(j including the variants i'ound among different strains 
within a liiven serotype. I'he recepior-bindint][ domains listed above arc used as the prototype 
> lor each strain within a serotype. Fusion proteins containing an analogous region from a 
strain other than the prototype strain are encompassed by the present invention. 

l usion proteins comprising the receptor binding domain [i.e.. C fragment) ol' botulinal 
toxins may include amino acid residues located beyond the termini oi the domains defined 
above, ! or example, the pHisBotH protein contains amino acid residues 846-1291 of SI*0 11^ 

10 N():40; this fusion protein thus comprises the receptor-binding domain for ( * hotulinum type 
B toxin as dellned above H e.. Ile-848 through Cilu-1291 ). Similarly, pHisBoilZ contains 
amino acid residues 827-1252 of SliQ ID NO:50 and pHisBotCi contains amino acid residues 
851-1297 of Sno ID N():77. Thus, both pHisBotH and pHisBoiCi lusion proteins contain a 
few amino acids located beyond the N-terminus of the defmcd receptor-binding domain. 

l "^ The terms "native gene" or "native gene sequences" are used to indicate DNA 

sequences encoding a particular gene which contain the same DNA sequences as found in the 
gene as isolated from nature. In contrast, "synthetic gene sequences" arc DNA sequences 
which are used to replace the naturally occurring DKA sequences wlien the naiurallv 
occurring sequences cause expression problems in a gi\cn host cell, l or example, nalurallv- 

20 tu'curring DNA sequences encoding codons which are rarely used in a host cell may be 

replaced {e by site-directed mutagenesis) such that the synthetic DNA sequence represents 
a more frequently used codon. I he native DNA sequence and the synilieiic DNA sequence 
will preferably encode the same amino acid sequence. 

25 .SUMMARY OF THE INVENTION 

The presenl invention relates to the production of polypeptides derived trom toxins 
particularly in recombinant ho.si cells. In one embodiment, the present invention provides a 
host eel! containing a recombinant expression vector, said vector encoding a protein 
comprising at least a portion of a Clostridium hotulinum toxin, said toxin selected from the 
30 group consisting of type B toxin and type E toxin. I he present invention is no\ limited bv 

the nature t>f sequences encoding portions of the (' hotulinum toxin. I hese sequences mav be 



-20- 



WO9S/0S540 



PCT/US97/15394 



derived from the native gene sequences or alternatively they may comprise synthetic gene 
sequences. Synthetic gene sequences are employed when expression of the native gene 
sequences is problematic in a given host cell (c i:.. when the native gene sequences contain 
sequences resembling yeast transcription termination signals and the desired host cell is a 
yeast cell). 

In one embodiment, the host cell is capable of expressing the recombinant C. 
lytiiflhitim toxin protein at a level greater than or equal to 2% to 40% of the total cellular 
protein and prcicrably at a level greater than or equal to 5% of the total cellular protein. In 
another embodiment, the host cell is capable of expressing the recombinant i \ hoiulhmm 
toxin protein as a soluble protein at a level greater than or equal to 0.25% of the total cellular 
protein and preferably at a level greater than or equal to 0.25% to 10% of the total cellular 
protein. 

The present invention is not limited by the nature of the host cell employed for the 
production ol recombinant ( ' hotulinwu loxxn proteins. In a preferred embodiment, the host 
cell is an /; coli cell. In another preferred embodiment, the host cell is an insect cell: 
particularly preferred insect host cells are Spodopicra fruiiipenia (Sf^)) cells. In another 
prelerred cmbtKiimenl. the host cell is a yeast cell: particularly preferred yeast cells are Pichia 
past oris cells. 

In another embodiment, the in\eniion provides a host cell containing a recombinant 
expression \ecior. said vector encoding a fusion protein comprising a non-toxin protein 
sequence and at least a portion of a Closiridium hotitlimim toxin, said toxin selected irom the 
group consisting of type B toxin and type H toxin. The invention is not limited by the nature 
ol the porti(>n oi* the Clostruiium hotulinum toxin selected. In a preferred embodiment, the 
portion of the toxin comprises the receptor binding domain {i.e.. the C fragment of the toxin). 
The present inxeniion is not limited by the nature of the non-toxin protein sequence 
employed. In a preferred embodiment, the non-toxin protein sequence comprises a poly- 
histidine tract. A number of alternative fusion tags or fusion partners are known to the art 
(e.i^,, MB1\ (iST. protein A. etc.) and may be employed for the production of fusion proteins 
comprising a portion of a botulinal toxin. 



- 21 - 



PCT/US97/15394 



The present inveniion further provides a vaccine comprising a fusion protein, said 
fusion protein comprising a non-toxin protein sequence and at least a portion of a Closiridium 
hotulhnim toxin, said toxin selected from the group consisting of type B toxin and type K 
toxin, fhe vaccine inay be a monovalent vaccine (i c containing only a toxin B fusion 
5 protein or a toxin E fusion protein), a bivalent vaccine {lc. containing both a toxin B fusion 
protein and a toxin E fusion protein) or a trivalenl or higher \'alenc>- vaccine. In a preferred 
embodiment, the toxin B fusion protein and/or toxin I: fusion protein is combined with a 
fusion protein comprising a non-toxin protein sequence and at least a portion o\ i 'losintiium 
hntulinum type A toxin. I he present invention is not limited by the nature of the portion of 

10 the Clostridium hoittlinmn toxin selected. In a preferred embodiment, the portion of the toxin 
comprises the receptor binding domain (i.e., the C fragment of the toxin), fhe present 
invention is not limited by the nature of the non-toxin protein sequence emploved. in a 
preferred embodiment, the non-toxin protein sequence comprises a poly-histidine tract. A 
number t>f allernaiive fusion lags or fusion partners are knimn to the art {e i^,, MliP. (iS'f. 

15 protein A. etc.) and may be employed for the generation of fusion proteins comprising 

\accines. When a fusion partner {i.e.. the non-toxin protein .sequence) is employed lor the 
production of a recombinant ( \ hnfulinai toxin protein, the fusion partner nia\ be removed 
Irom the recombinant ( *. hofti/inal toxin protein if desired (i.e., prior to administration of the 
protein to a subject) using a variety of methods known to the art {c\^,, digestion of fusion 

20 proteins containing I'actorXa or thrombin recognition sites with the appropriate eni?:vme). A 
number of the pETHis \ ectors employed herein provide an N-terminal his-iag Jbll()\\ed bv a 
l aciorXa elea\age site (see Example 28a): the botulinal C fragment sequences follow the 
I actorXa site and thus. EactorXa can be used to remove the hi.s-tag from the botulinal fusion 
protein. In a preferred embodiment, the vaccine is substantially endotoxin-lree. 

25 I he present invention is not limited by the method employed for the generation of 

vaccine comprising fusion proteins comprising a non-toxin protein sequence and at least a 
portion of a Clostridium hotulimuu toxin. The fusion proteins may be produced by 
recombinant DNA means using either native or synthetic gene sequences expressed in a host 
cell, fhe present invention is not limited to the production of \accines using recombinant 

30 host cells: cell (Vee in vitro transcription/translation systems may be employed for the 
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expression of the nucleic acid constructs encoding Ihe fusion proteins of the present invention. 
An example of such a cell-free system is the commercially available Tn P'^ Coupled 
Reticulocyte Lysate System (Promega Corporation. Madison. WI). Alternatively, the fusion 
proteins of the present invention may be generated by s\ ntheiic means {i.e., peptide 
3 synthesis). 

fhc present invention further provides a method of ueneratinii antibody directed 
auainsl a i'losfridium hotulimim toxin comprising: a) providing in any order: i) an antigen 
comprising a fusion protein comprising a non-toxin protein sequence and at least a portion of 
a Closirulium hotuUmm toxin, said toxin selected from the group consisting of type B toxin 

10 and type U toxin, and ii) a host: and b) immunizing the hosi with the antigen so as to 

generate an anlibod\\ In a preierred embodiment, the antigen used to immunize the host also 
coniains a liision protein comprising a non-toxin protein sequence and at least a portion of 
( 'htsiridiunt hoiulirutm t\ pe A toxin. I he present invention is not limited by the nature of the 
portion ol' the Clostridium hoiulinum toxin selected. In a preferred embodiment, the portion 

15 of the loxin comprises the receptor binding domain {ix.^ the C fragment of the toxin). 1 he 

present inv ention is noi limited by the nature of the non-toxin protein sequence employed. In 
a preferred embodiment, the non-toxin protein sequence comprises a poly-histidine tract. \ 
number of alternative fusion lags or fusion partners are known to the art (e .Lr-. MBP. GST, 
protein A. etc.) and may be employed lor the generation of fusion proteins comprising 

20 vaccines. When a fusion partner t/ e., the non-toxin protein .sequence) is employed for the 
production of a recombinant C hiUulinul toxin protein, the fusion partner may be removed 
from the recombinant C hotulimil toxin protein if desired {i.e.. prior to administratit>n o( the 
protein to a subject) using a \ ariety of methods known to the art it' .t;.. digestion of tusion 
proteins containing FactorXa or thrombin recognition sites with the appropriate enzyme). 

25 fhe present invention is not limited by the nature of the host employed for the 

production of the antibodies of the invention, hi a preferred embodiment, the host is a 
mammal, preferably a human. The antibodies of the present invention may be generated 
using non-mammalian hosts such as birds, preferably chickens. In a preferred embodiment 
the method of the present invention further comprised the step c) of collecting the antibodies 
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from the hosl. In yel another embodiment, the method of the present invention further 
comprises the step d) of purifying the antibodies. 

The present invention further provides antibodies raised accordinu to the above 
methods. 

> The present invention further contemplates muhivaieni vaccines comprising at least 

two recombinant C \ hoiuUmtm toxin proteins derived from the group consisting of C 
hotuli/wm serotypes A. B. C. D. E. F. and G. The invention contemplates bivalent, irivalent, 
L|uadra\alenl. pentavalent. heptavalent and septivalent vaccines comprising recombinant ( . 
ho/iflinum toxin proteins. Preferably the recombinant ( ' hotulinum toxin protein comprises 
If) the receptor binding domain (i.e.. C fragment) of the toxin. 

DESCRIPTION OF THE INVENTION 

The present invention contemplates vaccinating humans and other animals vwiih 
polypeptides derived from (*. huntlinum neurotoxins which are substantially endoioxin-free. 
15 Fhese boluiinal peptides are alst) u.seful for the production of antitoxin. Anli-botulinal loxin 
antitoxin is uselul for the treatment of patients effected by or at risk of symptoms due to the 
action of (' hontlinum toxins. I he organisms, toxins and individual steps of the present 
in\enlion are described separately below. 

20 I. Ciostridiitm Species, Clo.stridiai Diseases And Associated Toxins 

A preferred embodiment of the method of the present invention is directed toward 
obtaining antibodies against i 'lostriiHuni species, their toxins, enzymes or other mctabi>Iic b\- 
products. cell wall components, or synthetic or recombinant \ ersions ot any of these 
compounds. It is contemplated that these antibodies will be produced by immunization of 
25 humans or other animals. It is not intended that the present invention be limited to any 

particular toxin or any species of organism. In one embodiment, toxins from all Closfridhtm 
species are contemplated as immunogens. Examples of these toxins include the neuraminidase 
loxin of ( ' humicitm. C. sordellii toxins H I and IT. toxins A. H. C\ 1). I.-. I, and (i of (\ 
h(tni/inum and the numerous ( perfrinj^cns toxins. In one preferred embodimeni. toxins A. 
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B. and F of C. hondimm are contemplated as immunogens. Table 2 above lists various 
Chhsindtum species, their toxins and some antigens associated with disease. 

TABLE 2 

Clostridial Toxins 



15 



Organism 


Toxins and Disease-Associated Antigens 


( ' hmulinum 


A. B. C C,, D. E. F. G 


( \ hit /\r It tun 


Ni!uraminidase 


C. iliffkih 


A. B. F.nierotoxin (not A nor B). Moltliry Altering Factor. Low 
Molecular Wcighl To.\in. Olhcrs 


t * }hTtnn\icns 


n, p. c. 1. y. 6. V. 0, k'. X. ^t. u 


C " \ttrJclh 

( ' hifcnm'nutns 


HT. IT. u. 11 y 


C . iun xi 


a. |i. y. 0. c. C V, 0 


( ' \cff/n'ifnt 


a. |1 v. o 


i '. fn\to/yiti tail 


a. p. y. iS, c plus additional enzymes 


( c/lttUVf/ff 


(I. p. y. iS 



It is not intended that antibodies produced against one toxin will only be used against 
that toxin, it is contemplated that antibodies directed against one toxin {va^.. C perfrini^cnx 
type A cnicroioxin) may be used as an etTective therapeutic against one or more toxin(s) 
produced by other members of the genus Closfndium or other toxin producing organisms 

. fiaci/hts ccrcit.s. Staphylococcus aureus. Streptococcus mufam, Acinctohacicr 
ca/coaceitcus. Pscitdimionas aerui^inosa. other Pseudomnmts species, etc.). It is I'urther 
ct)nicniplated that antibodies directed against the portion ol the U)xin vvhicli binds to 
mammalian membranes {e.^., (' pc/frini^em enteroioxin A) can also be used against other 
organisms. It is contemplated that these membrane binding domains are produced 
synthetically and used as immunogens. 

II. Obtaining Antibodies In Non-Mammals 

A preferred embtxiimeni of the method of the present invention for obtaininu 
antibodies invofves immunization. However, it is also contemplated that antibodies could be 
obtained irom non-mammals without immunization. In the case where no immunization is 
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contemplated, the present invention may use non-mammals with preexisting antibodies to 
toxins as well as non-mammals that have antibodies to whole organisms by virtue of reactions 
with the administered antigen. An example of the latter involves immunization with synthetic 
peptides or recombinam proteins sharing epitopes with whole organism components. 

In a preferred embodiment, the method of the present invention contemplates 
immunizing non-mammals with bacterial toxin(s). It is not intended that the present invention 
be limited to any particular toxin. In one embodiment, toxin from ail clostridial bacteria 
stiurces i.scc Table 2) are contemplated as immunogens. Hxamples ot these toxins are C 
hniyricum neuraminidase toxin, toxins A. C. F.. l\ and Ci from ( *. hoiulinitm . 
C \ pcrfrm\^cns toxins (x. \\. k. and i, and C. sordcUii toxins I IT and I T. In a prei'erred 
embodiment, C ho/ulinum toxins A. B. C. D. E. and F (or fragments thereoO are 
contemplated as immunogens. 

A particularly preferred embodiment involves the use ot bacterial toxin protein or 
fragmenis of toxin proteins produced by molecular biological means {i.e.. recombinant toxin 
proteins). In a preferred embodiment, the immunogen comprises the receptor-binding domain 
{i.e.. the - 50 kl) carboxy-terminal portion of the heavy chain: also referred to as the C 
fragment) of ( * hoiulimmi serotype A neurotoxin produced by recombinant DNA technology. 
In another preferred embodiment, the immunogen comprises the receptor-binding domain oi* 
i\ hoiiilinnm serotype B neurotoxin produced by recombinant DNA technology. In yci 
another preferred embodiment, tiie immunogen comprises the receptor-binding domain region 
off*, hoiulinttm serotype F neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the recepior-binding domain region 
ol ^'. hniufinum serotype CI neurotoxin produced by recombinant DNA technology, hi yet 
another preferred embodiment, the immunogen comprises the recepior-binding domain region 
of C. homlifv/m serotype C2 neurotoxin produced by recombinant DNA technology. In yet 
another i^referred embodiment, the immunogen comprises the receptor-binding domain region 
of C. hofulhmm serotype D neurotoxin produced by recombinant DNA lechnolog\ . In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain region 
of hondinimi serotype F neurotoxin produced by recombinant DNA technology. In yet 
another preferred embodiment, the immunogen comprises the receptor-binding domain region 

hotu/inum serotype Ci neurotoxin produced by recombinant DNA tcchnolog_\ . In a 
preferred embodiment, the recombinant bolulinal toxin proteins are expressed as fusion 
proteins {ca:.. as hisiidine-lagged proteins). In a still further preferred embodiment, the 
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imniunoiicn is a muliivaleni vaccine comprising the receptor-binding domain region of C. 
hoiitlinum toxin from two or more toxins selected from the group consisting of type A, type 
B. type C (including CI and C2), type D. type E. and type F toxin. 

When immunization is used, the preferred non-mammal is JVom the class Avcs. All 
5 birds arc contemplated (eji^.. duck, ostrich, emu. turkey, etc.)- A preferred bird is a chicken. 
Importantly, chicken antibody does not fix mammalian complement. [See H.N. Benson ci ai. 
J. Immunol. 87:616 Thus, chicken antibody will normally not cause a complement- 

dependent reaction. I A. A. Benedict and K. Yamaga. ''luinumo^^lohulim ami Aniihody 
Proiluction in Avian Species/' in i iimpanUive Immunologic* (JJ. Marchaloni. ed.), pp. 3.15- 

10 375. Blackwcll. Oxford (1%6).| Thus, the preferred antitoxins ol the present invention will 
not exhibit complement-related side etlecis observed with antitoxins known presently. 

\\'hen birds are used, it is contemplated that the antibody will be obtained from either 
the bird .serum or the egg. A preferred embodiment involves collection of the antibody from 
the egg. Laying liens transport immunoglobulin to the egg yolk ("IgY") in concentrations 

15 equal io or exceeding that found in scrum. [See R. Patterson et al., J. Immunol. 89:272 

n%2): and S.B. Carroll and B.D. Stollar. J. Biol, Chem. 258:24 (1983).] In addition, the 
large \A)lume of egg yolk produced \ astly exceeds the volume of scrum thai can be safely 
obtained f rom the bird over any gi\en time period. Final h . the antibod> from eggs is purer 
and more homogeneous: there is far less non-immunoglobulin protein (as compared to serum) 

20 and only one class of immunoglobulin is transported to the yolk. 

When considering immunization with toxins, one may consider modification of the 
toxins to reduce the toxicity. In this regard, it is not intended that the present invention be 
limited by immunization with modified toxin. Unmodified ("native") toxin is also 
contemplated as an inimunogen. 

25 It is also not intended that the present invention he limited by the type of modification 

~ if modification is used. The present invention contemplates all types ol' toxin modification, 
including chemical and heal treatment of the loxin. The preferred modification, however, is 
formaldehyde treatment. 

U is not intended that the present invention be limited to a particular mode of 

30 immunization: the present invention contemplates all modes of immunization, including 

subcutaneous, intramuscular, intraperitoneal, and intravenous or intravascular injeciion, as well 
as per o.s administration of inimunogen. 
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The present invention further contemplates immunization with or without adjuvant. 
(Adjuvant is defined as a substance known lo increase the immune response to other antigens 
when administered with other antigens.) If adjuvant is used, it is not intended that the present 
invention be limited to any particular type of adjuvant — or that the same adjuvant, once used, 
be used all the time. While the present invention contemplates all types of adjuvant, whether 
used .separately or in combinations, the preferred use of adjuvant is the use oi' Complete 
Freund s Adjuvant followed sometime later with Incomplete lieund s Adjuvant. Another 
preferred use of adjuvant is the use of Gerbu Adjuvant, The invention also contemplates the 
use of RIB! fowl adjuvant and Qui] A adjuvant. 

When immunization is used, the present invention contemplates a wide variety of 
immunization schedules. In one embodiment, a chicken is administered toxin(s) on day zero 
and subsequently receives loxin(s) in intervals thereafter, it is not intended that the present 
invention be limited by the particular inter\als or doses. Similarly, it is not intended that the 
present invention be limited to any particular schedule for collecting antibody, fhe preferred 
collection lime is sometime after day 100. 

Where birds are used and collection of antibody is performed by collecting eggs, the 
eggs may be stored prior to processing for antibody. It is preferred that eggs be stored ai 4°C 
for less than one year- 
It is contemplated that chicken antibody produced in this manner can be buffer- 
extracted and used analytically. While unpurified. this preparation can serve as a reference 
for activity of the antibody prior to further manipulations [c.i:., immunoaffmity purification). 

III. Increasing: The Effectiveness Of Antibodies 

When purification is used, the present invention contemplates purifying to increase the 
effectixeness of both non-mammalian antitoxins and mammalian antitoxins. Specifically, the 
present invention contemplates increasing the percent of toxin-reactive immunoglobulin. I he 
preferred purification approach for avian antibody is polyethylene glycol (PlXi) separation. 

fhe present invention contemplates that avian antibody be initially purified using 
simple, inexpensive procedures. In one embodiment, chicken antibody from eggs is purified 
by extraction and precipitation with PIiG. PEG purification exploits the differential .solubility 
of lipids (which are abundant in egg yolks) and yolk proteins in high concentrations of PIXJ 
8000. [Poison et ai. Immunol. Comm. 9:495 (19K0).J fhe technique is rapid, simple, and 
relatively inexpensive and yields an immunoglobulin fraction that is significantly purer in 
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terms of coniaminaiing non-immunoglobulin proteins than the comparable ammonium sulfate 
fractions of mammalian sera and horse antibodies. I he majority of the PEG is removed from 
the precipitated chicken immunoglobulin by treatment with ethanol. Indeed, PEG-purified 
antibody is sufficiently pure that the present invention contemplates the use of PEG-purified 
5 antitoxins in the passive immunization oi* intoxicated humans and animals. 

IV. Treatment 

fhe present invention contemplates antitoxin therapy for humans and other animals 
intoxicated by bacterial toxins. A preferred method of treatment is by intravenous 
10 adminislraiion of anti-boutlinal antitoxin: oral administration is also contemplated for other 
clostridial antitoxins. 

A. D»sap;c Of Antitoxin 

II was noted by way of background that a balance must be struck when administering 

15 currently available antitoxin which is usually produced in large animals such as horses: 

sufficient antitoxin must be administered to neutralize the toxin, but not so much antitoxin as 
to increase the risk ol" untoward side effects. I hese side effects are caused by: i) patient 
sensili\'iiy \o foreign (c.i:. horse) proteins: ii) anaphylactic or immunogenic properties of non- 
immunoglobulin proteins; iii) the complement fixing properties of mammalian antibodies: 

20 and/or i\ ) the overall burden of foreign protein administered. It is extremely difficult to 

strike this balance when, as noted above, the degree of intoxication (and hence the level of 
antitoxin therapy needed) can only be approximated. 

1 he present invention contemplates significantly reducing side effects so that this 
balance is more easily achieved, freatmenl according to the present invention contemplates 

25 reducing side effects by using PEO-purifted antitoxin from birds. 

In one embodiment, the treatment of the present invention contemplates the use of 
PECi-purified antitoxin from birds. The use of yolk-derived, PEG-purified antibody as 
antitoxin allows for the administration of: I) non( mammalian )-compIemcnt-fixing. avian 
antibody: 2) a less heterogeneous mixture of non-immunoglobulin proteins: and 3) less total 

30 protein to deliver the equivalent weight of active antibody present in currently available 

antitoxins. The non-mammalian source of the antitoxin makes it useful for treating patients 
who are sensitive to horse or other mammalian sera. 
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B. Delivery Of Antitoxin 

Although it is not intended to limit the route of delivery, the present invention 
contemplates a method ibr antitoxin treatment oC bacterial intoxication in which delivery of 
antitoxin is oral. In one embodiment, antitoxin is delivered in a solid form {e.i^.. tablets). In 

5 an alternative embodiment antitoxin is delivered in an aqueous solution. When an aqueous 

solution is used, the solution has suftkient ionic strength to solubilize antibody protein, yet is 
made palatable tor oral administration. The delivery solution may aLso be buflcred {e.^.^ 
carbonate buffer pM 9.5) which can neutralize stomach acids and stabilize the antibodies when 
the antibt^dics are administered orally. In one embodiment the delivery solution is an aqueous 

H) solution. In another embodiment the delivery solution is a nutritional formula. Preferably. 

the delivery solution is infant formula. Yet another embodiment contemplates the delivery of 
lyophiiized antibody encapsulated or microencapsulated inside acid-resistant compounds. 

Methods of applyina enteric coalings to pharmaceutical compounds are well known lo 
ihe an [companies specializing in the coating of pharmaceutical compounds are available: for 

15 example. The Coating Place (Verona. Wl) and AAl (Wilmington. NO], Hnieric coatings 

which arc resistant to gastric fluid and whose release (/,t\. dissolution of the coaling lo release 
ihe pharmaceuiicai compound) is pH dependent are commercially available |lor example, the 
polymelhacrylalcs Ludragil": L and HudragitOiD S (Rohm CimbH)]. HudragilK S is .soluble in 
intesiinal tluid from pi I 7.0: this coating can be used to microencapsulate lyophiiized antitoxin 

20 antibodies and the panicles are suspended in a solution having a pH above or below pH 7.0 
for oral administration. The microparticles will remain intact and undi.ssolvcd until they 
reached the intestines where the intestinal pH would cause them to dissolve thereby releasing 
ihe antitoxiiv 

I he invention contemplates a method of treatment which can be administered for 
25 treatment of acute intoxication. In one embodiment, antitoxin is administered orally in either 

a delivery solution or in tablet form, in therapeutic dosage, to a subject intoxicated by the 

bacterial toxin which .served as immunogen for the antitoxin. 

I he invention also conlemplales a method of treatment which can be administered 

prophylactieally. In one embodiment, antitoxin is administered orally, in a delivery solution, 
^i) in therapeutic dosage, lo a subject, to prevent intoxication of the subject by the bacterial toxin 

which served as immunogen for the production of antitoxin. In another embodiment. 

antitoxin is administered orally in solid form such as tablets or as microencapsulated particles. 

Microencapsulation of lyophiiized antibody using compounds such as liudragit® (Rohm 
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GmbFH or polyethylene glycol , which dissolve at a wide range of pH units, allows the oral 
administration of solid antitoxin in a liquid form {i.e., a suspension) to recipients unable to 
tolerate administration of tablets (cr^.. children or patients on feeding tubes). In one preferred 
embodiment the subject is a child. In another embodiment, antibody raised against whole 
bacterial organism is administered orally to a subject, in a delivery solution, in therapeutic 
dosage. 

V. Vaccines Against Clostridial Species 

The invention contemplates the generation of mono- and multivalent vaccines for the 
proieclion of an animal (particularly humans) against several clostridial species. Of particular 
interest arc vaccines which stimulate the production of a humoral immune response to ( ' 
hodilinum. C ' tc/cnii and ( \ J/ffk ilc in humans. The antigens comprising the vaccine 
preparation may be native or rccombinantly produced toxin proteins from the clostridial 
species listed above. When toxin proteins are used as immunogens they are generally 
modified to reduce the toxicity. This modification may be by chemical or genetic (/.c\, 
recombinant DNA technology) means. In general genetic detoxification (/.e.. the expression 
of nontoxic fragments in a host cell) is prelerred as the expression of nontoxic fragments in a 
host cell precludes the presence of intact, active toxin in the final preparation. However, 
when chemical modification is desired, the preferred toxin modification is formaldehyde 
treatment. 

fhe invention contemplates that recombinant C hofu/intmi toxin proteins be u.sed as 
antigens in mono- and multivalent vaccine preparations. Soluble, substantially endotoxin- free 
recombinant ( \ hohtl'miim toxin proteins derived from serotypes A. B and C may be used 
individually (/.c, as mono-valent vaccines) or in combination {i.e.. as a mulli-valent vaccine). 
In addition, the recombinant i\ hotuHmtm toxin proteins derived from serotpes A. B and F. 
may be used in conjunction with either recombinant or native toxins or toxoids from other 
serotypes oi' ( ' hoddinum, C. difficile and C\ feumi as antigens for the preparation of these 
mono- and multivalent vaccines. It is contemplated that, due to the structural similarity of <\ 
hoiulinum and lelani toxin proteins, a vaccine comprising C difficile and boiulinimi toxin 
proteins (native or recombinant or a mixture thereof) be used to stimulate an immune 
response against (' hofidinum. C. teumi and C. difficile. 
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The present invention further contemplates muhi-valcni vaccines comprising two or 
more botulinal loxin proteins selected from the group comprising recombinant ( . hotulinum 
toxin proteins derived from serotypes A, B. C (including CI and C2). D. E. F and G. 

I he adverse consequences of exposure to botulinal loxin would be avoided by 
immunization of subjects at risk of exposure to the toxin v^ith nontoxic preparations which 
confer immunity such as chemically or genetically detoxified loxin. 

Vaccines which confer immunity against one or more of the toxin types A. B. \l. V 
and G would be useful as a means of protecting humans from the deleterious effects of those 
(' hotulinum toxins known to affect man. Indeed as the possibility exists that humans could 
be exposed to any of the seven serotypes of C hotulinum toxin (c\^„ during biological 
warfare or the production of toxin in a laboratory setting), multivalent vaccines capable of 
conferring immunity against loxin types A-G (including both CI and C2 toxins) would be 
useful for the protection of humans. Vaccines which confer immunity againsi one or more of 
ihc toxin t> pes C. D and E would be useful for veterinary applications. 

The botulinal neurotoxin is synthesized as a single polypeptide chain which is 
processed into a heavy (H: --100 kO) and a light (L: -50 kD) chain by cleavage with proteolytic 
enzymes; these two chains are held together via disulfide bonds in the active toxin (referred to 
as derivative loxin) |B.R, DasGupla and If Sugiyama, Biochcm. Biophys. Res. Commun. 
48:1 OK (1972): reviewed in B.R. DasGupta. J. Physiol. 84:220 (1990). If Sugiyama. 
Microbiol. Rev. 44:419 (1980) and C.L, Hatheway, Clin. MicrobioL Rev. 3:66 (1990)1. The 
heavy chain of the active loxin is cleaved by trypsin to produce two iVagmenis termed H, 
(als^> referred to as H, or C) and 11^ (also referred to as \ U or B). I he H,- fragment (-46 kD) 
comprises the carboxy end of the II chain. The \U fragmeni (--49 kD) comprises the animo 
end and remains attached to the L chain (H^L). Neither H, or H^L is toxic. M, competes 
with whole derivative toxin for binding to synaptosomes and therefore 1!^ is said lo contain 
the receptor binding site. The H^. and H., fragments oi* botulinal toxin are analogous to the 
fragments C and B of tetanus toxin which are produced by papain cleavage, f he C fragment 
of tetanus loxin has been shown to be responsible for the binding of tetanus toxin lo purified 
gangliosides and neuronal cells |Halpern and Loftus. J. Biol. Chem. 288:1 1 188 (1993)]. 

Antisera raised against purified preparations of isolated botulinal H and L chains have 
been shown to protect mice against the lethal effects of the loxin: however, the effectiveness 
of the two antisera differ with the anti-M sera being more potent (II. Sugiyama. w^/^ra). 
While the different botulinal toxins show structural similarity to one another, the different 
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serotypes are reported to be immunologically distinct (/.£.*., sera raised against one toxin type 
docs not cross-react to a significant degree with other types). Thus, the generation of 
mullivalenl vaccines may require the use oC more than one type of toxin. 

( ' hondinum toxin genes from all seven serotypes have been cloned and sequenced 

5 (Minion (1995). supra): in addition, partial amino acid sequence is available for a number of 
(' hoiulmum toxins isolated from different strains within a given serotype. The i\ botulimmi 
toxins contain about 1250-1300 amino acid residues. On the DNA level, the overall degree of 
homology between L\ hotuiinum serotypes A, B. C. D and H toxins averages between 50 and 
(>()% identity with a greater degree of homology being found between H chain-encoding 

U) regions than between those encoding L chains [Whelan c/ uL (1992) AppK Environ. 

Microbiol. 58:2345]. The degree of identity between C\ homlinum toxins on the amino acid 
level redecls the level of DNA sequence homology. The most divergent area of DNA and 
amino acid sequence is found within the carboxy-terminai area of the various C. hotuihium I I 
chain genes. This portion of the loxin (i c„ \\ ov the C fragment) plays a major role in cell 

15 binding. As loxin from different serotypes is thought to bind to distinct cell receptor 
molecules, ii is not surprising that the toxins diverge significantly over this region. 

Within a given serotype, small variations in the primary amino acid sequence of the 
botulinal toxins isolated from different strains has been reported | Whelan et uL (1992). supra 
and Minton (1995). supra], I bc present invention contemplates fusion proteins comprising 

20 portions ofC hotuiinum toxins from serotypes A-G including the variants found among 
different .strains within a given serotype. The present invention provides oligonucleotide 
primers which may be used to amplify the C fragment or receptor-binding region of the toxin 
gene Irom various strains ot C. hondinum serotype A. serotype B. serotype C (CI and C2). 
seroiyi^' I), serotype IZ. serotype P and serotype G. A large number of different strains of T. 

25 hondinum serotype A. serotype B. serotype C, serotype D serotype E and serotype F are 

available from the American Type Culture Collection (ATCC: Rockville, MD). For example, 
the ATCC provides the following: Type A strains: 174 (ATCC 3502), 457 (ATCC 17862), 
and NCTC 7272 (ATCC 19397); Type B strains: 34 (ATCC 439). 62A (ATCC 7948). NCA 
213 B (ATCC 7949), 13114 (ATCC 8083), 3137 (ATCC 17780). 1347 (ATCC 17841), 2017 

30 (ATCC 17843). 2217 (ATCC 17844). 2254 (ATCC 17845) and VP 1731 (ATCC 25765): 

Type C strains: 2220 (ATCC 17782). 2239 (ATCC 17783), 2223 (ATCC 17784: a type C-\S 
strain: C-p strains produce C2 toxin). 662 (ATCC 17849; a type C-a strain; C-a strains 
produce mainly CI toxin and a .small amount of C2 toxin). 2021 (ATCC 17850: a type C-a 
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Strain) and VP! 3803 (ATCC 25766); l ype D strains: ATCC 9633. 2023 (ATCC 17851), and 
VPI 5995 (ATCC 27517); Type IL strains: ATCC 4318L 36208 (ATCC 9564), 2231 (ATCC 
17786). 2229 (ATCC 17852), 2279 (ATCC 17854) and 2285 (ATCC 17855) and Type F 
strains: 202F (ATCC 23387). VPI 4404 (ATCC 25764), VPI 2382 (ATCC 27321) and 
5 Langeland (ATCC 35415). Type G strain, 1 13/30 (NCFB 3012) may be obtained from the 
National c;olIection of Food Bacteria (NCFB, AFRC Institute of Food Research. Reading, 
United Kingdom). 

Purification methods have been reported for native toxin types A. B, C. D. E. and F 
|revie\ved in G. Sakaguchi. Pharmac. Ther. 19:165 (!983)J. As the different botulinal toxins 

10 are structurally related, the invention contemplates the expression of any of the botulinal 
toxins (L'.tf , types A-G) as soluble recombinant fusion proteins. 

In particular, methods for purification of the type A botulinum neurotoxin have been 
developed [L.J. Moberg and H. Sugiyama. Appl. Environ. Microbiol. 35:878 (1978)|. 
Immunization of hens with detoxified purified protein resuUs in the generation of neutralizing 

15 antibodies |B.S. Fhalley ef uL. in Botulinum and Tetanus Neurotoxins, B.R. DasCjupta, ed.. 
Plenum Press. New York (1993). p. 467]. 

Ihe currently available i\ hotuliniim pentavalenl vaccine comprising chemically 
inactivated {i.e.. formaldehyde treated) type A, B, C, D and F toxins is not adequate. The 
eftlcacy is variable (in particular, only 78% of recipients produce protective levels of anti-type 

20 B antibodies following administration of the primary scries) and immunization is painful 

(deep subcutaneous inoculation is required for administration), with adverse reactions being 
common I moderate to severe local reactions occur in approximately 6% of recipients upon 
initial injection: this number rises to approximately 1 1% t>f individuals who receive booster 
injections) | Informational Brochure for the Pentavalenl (ABCI)F) Botulinum t oxoid. Centers 

25 for Disease Control). Preparation of this vaccine is dangerous as active toxin must be handled 
by laboratory workers. 

In general, chemical detoxification of bacterial toxins using agents such as 
Ibrmaldehyde. glutaraldehyde or hydrogen peroxide is not optimal for the generation of 
vaccines or antitoxins. A delicate balance must be struck between too much and too little 

30 chemical modification. If the treatment is insufficient, the vaccine may retain residual 

toxicity. If the treatment is loo excessive, the vaccine may lose potency due to destruction of 
native immunogenic determinants, .^nother major limitation ol' using botulinal toxoids for the 
generation of antitoxins or vaccines is the high production expense. For the above reasons. 
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the development of methods for the production of nontoxic but immunogenic C hofulinum 
toxin proteins is desirable. 

The C\ hotulinum and C. /efanus toxin proteins have similar structures frevieured in 
F.J. Schanlz and E.A. Johnson. Microbiol. Rev. 56:80 (1992)). The carboxv-terminal 50 kD 
fragment of the tetanus toxin heavy chain (fragment C) is released by papain cleavage and has 
been shown to be non-toxic and immunogenic. Recombinant tetanus toxin fragment C has 
been developed as a candidate vaccine antigen [A J. Makoffcv uL, Bio/Technology 7:1043 
(i989)|. iMice immunized with recombinant tetanus toxin fragment C were protected from 
challenge with lethal doses of tetanus toxin. No studies have demonstrated that the 
recombinant tetanus fragment C protein confers immunity against other botulinal toxins such 
as the (' hotulinum toxins. 

Recombinant tetanus fragment C has been expressed in coli (A.J, Makoff c/ cd., 
liionccbnology. supra and Nucleic Acids Res. 17:10191 (1989): J.L. Halpern ci uL. Infect. 
Immun. 58:1004 fl990)j. yeast [M A. Romanos et uL. Nucleic Acids Res. 19:1461 {199I)| 
and baculovirus jl.G. Charles et al„ Infect. Immun. 59:1627 (1991)1. Synthetic tetanus toxin 
genes had to be constructed to facilitate expression in yeast (M.A. Romanos a uL. supra) and 

coli fA.J. Makoff t'/ aL. Nucleic Acids Res., supra], due (o the high A-T content of the 
tetanus toxin gene sequences. High A-T content is a common feature oi clostridial genes 
(M R. Popoff cv aL. Infect. Immun. 59:3673 (1991); H.F. LaPenotierc ct uL. in Botulinum and 
Tetanus Xcurotoxins. B.R. DasGupta. ed.. Plenum Press. New York (1993). p. 463 1 which 
creates expression difficulties in £. coli and yeast due primarily to altered codon usage 
frequence and fortuitous polyadenylalion sites, respectively. 

The C fragment of the C\ hotulinum type A neurotoxin heavy chain has been evaluated 
as a vaccine candidate. The C\ htttulinum type A neurotoxin gene has been cloned and 
sequenced [D.F. Thompson et aL. Eur. J. Biochem. 189:73 (1990)). The C fragment of the 
type A toxin was expressed as either a fusion protein comprising the botulinal C fragment 
fused with the maltose binding protein (MBP) or as a native protein [H.F. LaPenotiere et aL, 
(1993) supra, II.F. LaPenotiere ct aL. Toxicon. 33:1383 (1995) and Middlebrook and Brown 
(1995). Curr. Top. Microbiol. Immunol. 195:89-122]. The plasmid construct encoding the 
native protein was reported to be unstable, while the fusion protein was expressed primarily in 
inclusion bodies as insoluble protein. Immunization of mice with crudely purified MBP 
fusion protein resulted in protection against IP challenge with 3 LD,„ doses of toxin 
IFaPenoliere et aL. (1993) and (1995), supra]. However, this recombinant C. hotulinum type 
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A toxin C fragment/MBP fusion proiein is not a suitable immunogen for the production of 
vaccines as it is expressed as an insoluble proiein in £. coli. Furthermore, this recombinant 
C. bofidinum type A toxin C fragment/MBP fusion protein was not shown to be substantially 
free of endotoxin contamination, fixperiencc with recombinant C hotidinum type A toxin C 

5 fragment/MBP fusion proteins shows that the presence of the MBP on the fusion protein 
greatly complicates the removal of endotoxin from preparations of the recombinant fusion 
protein {sec Ex. 24. infra). Expression of a synthetic gene encoding hotulinum type A 
loxin C fragment as a soluble protein excreted from insect cells has been reported 
[Middlcbrook and Brown (1995). sitpra]: no details regarding the level of expression achieved 

JO or the presence of endotoxin or other pyrogens were provided. Like the insoluble protein 

expressed in E. colL immunization with the recombinant protein produced in insect cells was 
reported to protect mice from challenge with C. hotulinum toxin A. 

Inclusion body protein must be solubiiized prior to purification and/or administration 
to a host. The harsh treatment of inclusion body protein needed to accomplish this 

15 solubilization may reduce the immunogenicity of the purified protein. Ideally, recombinant 
proteins to be used as vaccines are expressed as soluble proteins at high levels {/ 1^. greater 
than or equal to about 0.75% of total cellular proiein) in £. cofi or other host cells (c.}^.. 
yeast, insect cells, etc.). I his facilitates the production and isolation of sufficient quantities of 
the immunogen in a highly purified form (/,c\. substantially free of endotoxin or other 

20 pyrogen contamination). The ability to express recombinant loxin proteins as soluble proteins 
in £ coli is advantageous due to the low cost of growth compared to insect or mammalian 
tissue culture cells. 

The ( hofniinum type B neurotoxin gene has been cloned and sequenced from two 
strains of (*. hotulinum type B [Whelan ei ui (1992) Appl. Environ. Microbiol. 58:2345 

25 (Danish strain) and Hutson ei ai (1994) Curr. Microbiol. 28:101 (Ekiund 17B strain)]. The 
nucleotide sequence of the toxin gene derived from the Ekiund 17B strain (ATCC 25765) is 
available from the EMBL/CienBank sequence data banks under the accession number X7n43: 
the nucleotide sequence of the coding region is listed in SEQ ID NO:39. The amino acid 
sequence of the C. hotulinum type B neurotoxin derived from the strain Ekiund 17B is listed 

30 in SEQ ID NO:40. The nucleotide sequence of the ( \ hotulinum serotype B toxin gene 

derived from the Danish strain is listed in SEQ ID NO:41. The amino acid sequence of the 
r. hotulinum type B neurotoxin derived from the Danish strain is listed in SEQ ID NO:42. 
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The C. botulinum type B neurotoxin gene is synthesized as a single polypeptide chain 
which is processed to form a dimer composed of a light and a heavy chain linked via 
disulfide bonds. The light chain is responsible for pharmacological activity {i.e.. inhibition of 
the release of acetylcholine at the neuromuscular junction). The N-lcrminal portion of the 
5 heavy chain is thought to mediate channel formation while the C-terminal portion mediates 

toxin binding: the type B neurotoxin has been reported to exist as a mixture of predominantly 
single chain with some double chain (Whelan ei aL. supra)- The 50 kD carboxy-terminal 
portion of the heavy chain is referred to as the C fragment or the \\ domain. Tlie present 
invention reports for the first lime, the expression of the C fragment of C. boudinum type B 

10 toxin in heterologous hosts {e.}^.. E. coli). 

The C\ hofulinum type E neurotoxin gene has been cloned and sequenced from a 
number of different strains [Poulet et al. (1992) Biochem. Biophys. Res. Commun. 183:107: 
Whelan ei al, (1992) l£ur. J. Biochem. 204:657: and Fujii e/ aL (1993) J. Gen. Microbiol. 
139:79|. I he nucleotide sequence of the type E toxin gene is available from the EMBL 

15 sequence data bank under accession numbers X62089 (strain Beluga) and X62683 (strain 

NC TC 1 1219): the nucleotide sequence of the coding region (strain Beluga) is listed in SEQ 
ID NO:45, The amino acid sequence of the C\ hoiulinum type E neurotoxin derived from 
strain IkMuga is listed in SEQ ID NO:46. The type E neurotoxin gene is synthesized as a 
single polypeptide chain which may be converted to a double-chain form (/ c.. a heavy chain 

20 and a light chain) by cleavage with trypsin: unlike the type A neurotoxin, the type E 

neurotoxin exists essentially only in the single-chain form. The 50 kD carboxy-terminal 
portion of the heavy chain is referred to as the C fragment or the H^- domain. The present 
invention, reports for the first time, the expression of the C fragment o\ C. hoiulinum type E 
toxin in heterologous hosts (t' .c-. E. coli). 

25 The (' hntulinum type CI. D. F and G neurotoxin genes have been cloned and 

sequenced. The nucleotide and amino acid .sequences of these genes and toxins are provided 
herein. I he invention provides methods for the expression of the C fragment from each of 
these toxin genes in heterologous hosts and the purification of the resulting recombinant 
proteins. 

The subject invention provides methods which allow the production of soluble ( '. 
hntulinum toxin proteins in economical host cells (c.j^.. £. cali). In addition the subject 
invention provides methods which allow the production of soluble botulinal toxin proteins in 
yeast and insect cells. Further, methods for the isolation of purified soluble C. hntulinum 
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toxin proteins which are suitable for immunizaiion of humans and other animals are provided. 
These soluble, purified preparations of C botulinum toxin proteins provide the basis for 
improved vaccine preparations and facilitate the production of antitoxin. 

When recombinant clostridial toxin proteins produced in gram-negative bacteria (e.^'.. 
£ colt) arc used as vaccines, they are purified to remove endotoxin prior to administration to 
a host animal. In order to vaccinate a host, an immunoucnically-effeciive amount of purified 
substantially endotoxin-free recombinant clostridial toxin protein is administered in any of a 
number of physiologically acceptable carriers known to the art. When administered for the 
purpose of vaccination, the purified substantially endotoxin-free recombinant clostridial toxin 
protein may be used alone or in conjunction with known adjutants, including potassium alum, 
aluminum phosphate, aluminum hydroxide, Gerbu adjuvant (GmDP: C.C. Biotech Corp.). 
RIBI adjmant (MPL; RIBI Immunochemical Research. Inc.), 0S2I (Cambridge Biotech). 
The alum and aluminum-based adjutants arc particularly preferred when vaccines arc to be 
administered to humans: however, any adjuvant approved for use in humans may be 
employed. The route of immunizaiion may be nasal, oral, intramuscular, intraperitoneal or 
subcutaneous. 

I he invention contemplates the use of soluble, substantially endoloxin-free 
preparations of fusion proteins comprising the C fragment of the ( * hottdinum type A, B, C, 
D. E. F. and G toxin as vaccines. In one embodiment, the vaccine comprises the C fragment 
of either the C. hoiulinum type A, B, C. D, E, F, or (} toxin and a poly-hisiidine tract (also 
called a hislidine tag). In a particularly preferred embodiment, a fusion protein comprising 
the hislidine tagged C fragment is expressed using the pET series of expression vectors 
(Novagen) The pET expression system utilizes a vector containing the T7 promoter which 
encodes the fusion protein and a host cell which can be induced to express the T7 DNA 
polymerase {re, a DE3 host strain). The production of C fragment fusion proteins containing 
a histidine tract is not limited to the use of a particular expression vector and host strain. 
Several commercially available expression vectors and host strains can be used to express the 
C fragment protein sequences as a fusion protein containing a histidine tract (For example, the 
pOE series (pQE-S, 12, 16, 17, 18. 30, 31, 32, 40. 41, 42. 50. 51, 52. 60 ami 70) of 
expression \eciors (Qiagen) which are used with the host strains M15fpREP4| (Qiagen) and 
SG13009|pREP4| (Qiagen) can he used to express fusion proteins containing six histidine 
residues at the amino-terminus of the fusion protein). Furthermore a number of commercially 
available expression vectors which provide a histidine tract also provide a protease cleavage 
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site between the liistidine tract and the protein of interest U'-AT-, botulinal toxin sequences). 
Cleavaye of the resullin^^ fusion protein with the appropriate protease will remove the 
histidinc lag from the protein of interest , botulinal toxin sequences) (see Hxample 28a. 
infra). Removal of the histidine tau may be desirable prior to administration of the 
5 recombinant botuhnal toxin protein lo a subject (e.^., a human). 

VI. Detection Of Toxin 

The invention contemplates delecting bacterial toxin in a sample. The term "sample" 
in the present specification and claims is used in its broadest .sense. On the one hand it is 
10 meant to include a specimen or culture. On the other hand, it is meant to include both 
biological and en\ ironmental samples. 

Biol ouical samples mav be animal, including human, lluid. solid (cjif., stool) or tissues 
liquid and solid food products and ingredients such as dairy items, vegetables, meat and meat 
b\ -products, and waste. Environmental samples include environmental material such as 
15 surface matter, soil, water and industrial samples, as well as samples obtained from food and 
dair\ proces.sing instruments, apparatus, equipment, disposable and non-disposable items. 
These examples are not lo be construed as limiting the .sample types applicable lo the present 
invention. 

The invention contemplates detecting bacterial toxin by a competitive immunoassav 
20 method that utilizes recombinant toxin A and toxin B proteins, antibodies raised against 

recombinant bacterial toxin proteins. A fixed amount of the recombinant toxin proteins are 
immobilized lo a solid support {c.^.. a microtitcr plate) followed by the addition of a 
biological sample suspected of containing a bacterial toxin. The biological sample is first 
mixed with affinity-purified or PECi fractionated antibodies directed against the recombinant 
25 toxin protein. A reporter reagent is then added which is capable of detecting the presence of 
antibody bound to the immobilized toxin protein. The reporter substance may compri.se an 
antibody with binding specificity for the antitoxin attached to a molecule which is used to 
identify the presence of the reporter .substance. If toxin is present in the sample, this toxin 
will compete with the immobilized recombinant toxin protein for binding to the anti- 
50 recombinant antibody thereby reducing the signal obtained following the addition of the 

reporter reagent. A control is employed where the antibody is not mixed with the sample. 
This gives the highest (or reference) signal. 
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The invention also contemplates detecting bacterial toxin by a "sandwich" 
immunoassay method that utilizes antibodies directed against recombinant bacterial toxin 
proteins. Arfinily-purified antibodies directed against recombinant bacterial toxin proteins are 
immobilized to a solid support {e.^., microtitcr plates). Biological samples suspected of 
containing bacterial toxins are then added followed by a washing step to remove substantially 
all unbound antitoxin. The biological sample is next exposed to the reporter substance, which 
binds to antitoxin and is then washed free of substantiall\ all unbound reporter substance. 
The reporter substance may comprise an antibody with binding specitlciiy Ibr the antitoxin 
attached to a molecule whicli is used to identify the presence of the reporter substance. 
Idenliilcaiion of the reporter substance in the biological tissue indicates the presence of the 
bacterial toxin- 
It is also contemplated that bacterial loxin be detected by pouring liquids (c.^' . soups 
and oilier iluid foods and teeds including nutritional supplements Ibr humans and other 
animals) over immobilized antibody which is directed against the bacterial toxin. It is 
contemplated that the immobilized antibody will be present in or on such supports as 
cartridges, columns, beads, or any other solid support medium. In one embodiment, following 
the exposure of the liquid lo the immobilized antibody, unbound toxin is subslaniially 
renio\ ed by washing. 1 he exposure of the liquid is then exposed U) a reporter substance 
which delects the presence of bound toxin. In a preferred embodiment the reporter substance 
is an enzyme, lluoresceni dye. or radioactive compound attached lo an antibody which is 
directed against the loxin {/.c, in a "sandwich" immunoassay). Ii is aho contemplated that 
the detection system will be developed as necessary (t\,v:., the addition oi enzyme substrate in 
enzyme systems: observation using nuoresceni light for lluoresceni dye systems: and 
quanlilaiion ol' radioacli\ it> for radioactive systems). 

EXPERIMENTAL 

1 he Ibllowing examples serve to illusn^ate certain preierred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof 

In the disclosure which follows, the following abbreviations apply: "^C (degrees 
C enligrade): rpm (revolutions per minute): BBS-1 ween (borate luifiered saline containing 
fween): OS A (bovine serum albumin): HI, ISA (enzyme-linked immunosorbeni assay): CfA 
(complele I'reund's adjuvant): \VA (incomplete freund s adjuvanl): IgCi (immunoglobulin (J): 
IgY (immunoglobulin Y): IM (intramuscular); IP (intraperitoneal): IV (intravenous or 
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intravascular): SC (subcutaneous); IW (water): HCl (hydrochloric acid); LDnj,) (lethal dose 
ior 100% of experimental animals): aa (amino acid): HPLC (high performance liquid 
L-hromatography); kD (kiiodallons): gm (grams): (micrograms); mg (milligrams): ng 
(nanograms): (microliters): ml (milliliters): mm (millimeicrs): nm (nanometers): yim 
5 Onicromeier): M (molar): mM (millimoJar): MW (molecular weight): sec (seconds); min(s) 
(minute/minutes): hr(s) (hour/hours): MgCU (magnesium chloride): NaCl (sodium chloride): 
N'aXX): (sodium carbonate): ()D^^„ (optical density at 280 nm); OD^,,,, (optical density at 600 
nmj: PAGL (polyacrylamide gel electrophoresis): FBS [phosphate buffered saline (150 niM 
NaCL 10 jnM sodium phosphate buffer, pH 7.2) J; PEG (polyethylene glycol): PMSF 

10 (phenyhnethylsulfonyl tluoride); SDS (sodium dodecyl sulfate): Tris 

(tris(hydro\ymeihyl)aminomelhane): I:nsure'H> (Knsure Ross Laboratories. C^ilumbus Oil): 
l-nfamil « (l:nl'amil« - Mead Johnson): w/v {weight to volume): v/v (volume to volume); 
Amicon (Anucon. Inc.. lievcriy. MAV. Amresco (Amresco. Inc.. Soion. OlU: ATCC 
(American Type C ulture Collection. Rockville, MD): BBl. (Baltimore Biologies 1-aboratory. 

15 (a di\ ision of Becion Dickinson). Cockeysvillc. MD): Becion Dickinson (Beclon Dickinson 
l.abware. Lincoln Park, N.l): BioRad (BioRad. Richmond. i A): Biotech (C'-C Biotech C'orp., 
Poway. CA): Charles River (C harles River Laboratories. Wilmington. MA); Cocalico 
(Cocalico fiiologicals Inc.. Reamstown. PA); CytRx (CytRx Corp.. Norcross. (}A): l alcon 
(c Baxter Healthcare Corp.. McCiaw Park. IL and Becton Dickinson): LDA (I ederai Food 

20 and Drug Administration): I ishcr Biotech (Fisher Biotech. Springfield. N.I): (IIBCX) (Grand 
Island liiologic Company/BRL. (irand Island, NY); Gibco-BRL (Life I echnologies. Inc.. 
Gaiihersburg. MD): Harlan Spraguc Dawiey (Harlan -Sprague Dawlev. Inc., Madi.son. Wl): 
Mallinckrodt (a division of Baxter Healthcare Corp.. McCiaw Park. IL): Milliporc (Millipore 
Corp.. Marlborough. MA): New Hngland Biolabs (New Lngland Biolah.s. Inc.. Beverly. MA): 

25 Novagen (Novagen. Inc.. Madison. WI); Pharmacia (Pharmacia. Inc., Piscataway. NJ); Qiagen 
(C^iagen. Chalsworth. CA): Sasco (Sasco, Omaha. NL): Showdex (Shovva Denko America, 
Inc.. New York. NY): Sigma (Sigma Chemical Co,. St. Louis. MO): Sterogene (Sierogene. 
Inc.. .\rcadia. CA); lech Lab ( lech Lab. Inc.. Blacksburg. VA): and Vaxcell (Vaxcell. hic, 
a subsidiary ofCytRX Corp.. Norcross. GA). 

,">0 When a recombinant protein is described in the speciilcalion it is relerred to in a 

short-hand manner b\ the amino acids in the toxin sequence present in the recombinant 
protein rounded to the nearest 10. For example, the recomhinani protein pMBlR50-236O 
contains amino acids 1852 through 2362 of the C. diflkilc toxin B protein. The specification 
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gives detailed construction details for all recombinant proteins such that one skilled in the art 
will know precisely which amino acids are present in a given recombinant protein. 

EXAMPLE I 

5 Production Of High- l iter Antibodies To C losirklium difficile Organisms In A Hen 

Antibodies to certain pathogenic organisms have been shown to be eftbctivc in treating 
diseases caused by those organisms. It has not been shown whether antibodies can be raised, 
against ClosiriJiitni difficile, which would be ctTcclivc in treating infection by this organism. 
H) Accordingly, (' difficile was tested as immunogen for production of hen antibodies, 

t o determine the best course for raising high-liter egg antibodies against whole ( ' 
difficile organisms, different immunizing strains and different immunizing concentrations were 
examined. I he example involved (a) preparation of the bacterial immunogen. 
(b) imnumizaiion. (e) purification of anti-bacterial chicken antibodies, and id) detection of 
anti-bacterial antibodies in the purified IgY preparations. 



a) Preparation Of Bacterial Immunogen 

( \ difficile strains 43594 (scrogroup A) and 43596 (serogroup (_ ) were originally 
obtained from the A TCC. These two strains were selected because they represeni two oi the 

20 most common!) -occurring serogroups isolated from palienls with aniibii)lic-associaicd 
pseudomembranous colitis. jDelmee ei uL. J. Clin. Microbiol.. 2K( U)):221(> ( 199()).| 
/Vdditionally. both of these strains have been previously characterized with respect to their 
\ iruicnce in the Syrian hamster model for i \ difficile infection. [Delmce ei aL, J. Med 
Microbiol.. 33:85 (1990)1 

25 The bacterial strains were separately cultured on brain heart infusion agar for 48 hours 

at 37^C in a Gas Pack 100 Jar (BBL. Cockcysvillc. MD) equipped with a Gas Pack Plus 
anaerobic envelope (BBI.). f-ortx -eight hour cultures were used because they produce better 
growth and the organisms have been found to be more cros.s-reactive with respect their 
suriace antigen presentation. The greater the degree of cross-reaciivity oj our IgY 

30 preparations, the better the probability of a broad range of activity against different 
strains/serogroups. |Toma ci ai. J. C lin. Microbiol.. 26(3):426 (1988)1 

The resulting organisms were removed from the agar surface using a sterile dacron-tip 
swab, and were suspended in a solution containing 0.4% formaldehyde in PBS. pi I 7.2. This 
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concentration of formaldehyde has been reported as producing good results for the purpose of 
preparing whole-organism immunogen suspensions for the generation of polyclonal anti-C. 
difficile anlisera in rabbits. fDelniee et uL. J. Clin. Microbiol.. 21:323 (1985); Davies at al . 
Microbial Path,. 9:141 (1990).] In this manner, two separate bacterial suspensions were 
- 5 prepared, one for each strain. The two suspensions were then incubated at 4**C for I hour. 

I oikm ing this period oi' formalin-treatment, the suspensions were cenirifuged at 4.200 x g for 
20 min.. and the resulting pellets were washed twice in normal saline. The washed pellets, 
which contained formalin-heated whole organisms, were resuspended in fresh normal saline 
such that the visual turbidity of each suspension corrcsptinded to a ^7 McFarland standard. 

10 (M.AX . Ildelstcin. "Proccs.sini: Clinical Specimens for Anaerobic Bacteria: I.sofafion am/ 
h/cnffficaiion Procedures:' in S.M. Finegold el al (cds.).. Bailey and Scoff .s l^ia^^nosfic 
Microhioloi^y, pp. 477-507. ('.V, Mosby Co.. (1990). The preparation of McFarland 
nephelometer standards and the corresponding approximate number oi" organisms Ibr each 
tube arc described in detail at pp. 172-173 of this volume.) Faeh of the two Ul suspensions 

15 was then split into two separate volumes. One volume of each suspension was volumetrically 
adjusicd. by the addition of saline, lo correspond to the visual turbidity oi'a U\ McFarland 
standard. |A/.| The ?/l su.spensions contained approximately 3 \ 10^ organisms/mL and the 
n suspensions contained approximately 2 x 10'* organisms/ml. |A/.l The four resulting 
conceniraiion-ad)usted suspensions of formalin-treated C. difficile organisms were considered 

20 to be "bacterial immunogen suspensions." These suspensions were used immediately after 
preparation for the initial immunization. |.See section (b).| 

fhc formalin-treatment procedure did not result in 100% non-viable bacteria in the 
immunogen suspensions. In order to increase the level of killing, the lormalin concentration 
and length of ircalmeni were both increased for subsequent immunogen preparations, as 

25 described below in Table 3. (Although viability was decreased with the stronger ibrmalin 

treatment- 100% inviability of the bacterial immunogen suspensions was not reached.) Also, 
in subsequent immunogen preparations, the ibrmalin solutions were prepared in normal saline 
instead of PBS. Al day 49. the day of the fifth immunization, the excess volumes of the four 
previous bacterial immunogen suspensions were stored frozen al -7()°C for use during all 

30 subsequent immunizations. 
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b) Immunization 

For ihe initial immunizalion. 1.0 ml volumes of each of the ibur bacterial immunogen 
suspensions described above were separately emulsified in 1.2 ml volumes of CFA (GIBCO). 
For each of the four emulsified immunogen suspensions, two Ibur-nionih old While Leghorn 
hens (pre-laying) were immunized. (It is not necessary lo use pre-laving hens: actively-laying 
hens can also be utilized.) Fach hen received a total volume of approximately 1.0 ml of a 
single emulsified immunogen suspension via four injections (two subcutaneous and two 
intramuscular) oJ" approximately 250 |,il per site. In this manner, a ttUal ol four different 
immunization combinations, using two hens per combination, were initiated for the purpose of 
evaluating both the effect of immunizing concentration on egg yolk antibody (IgY) 
production, and interslrain cross-reactivity of IgY raised against heterologous strains. The 
four immunization groups are summarized in Table 3. 

TABLE 3 



Inimuni/ntion Cj roups 



(iroup lX\siunation 


Immunizinu Strain 


Approximaic Immuni/iiii! Dose 


C D 43594. «l 


C difficile 
strain 435^4 


I S ^ KT i)r^anisnis. hen 


I I) 4.*544. .'-7 




1 (» ^ 10 oriianisms hen 


CD 4359(k '- I 


(* difjidh 
strain 43596 


\.5 ^ 10'' orjzanismji hen 


CD 4.^596. 




10 M 10' nri!anisms:hen 



The Inue point for the first series of immunizations was designated as "day zero." All 
subsequent immunizations were performed as described above except that the bacterial 
immunogen suspensions were emulsified using 11 A ((jIBCX)) instead of CI A. and for the 
later lime point immunizalion* the stored frozen suspensions were used instead of freshly- 
prepared suspensions. The immunization schedule used is listed in l able 4. 
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TABLE 4 

Immunization Schedule 



Day or liiimunizaiion 


Formalin- Ircalmcnt 


Immunogen Preparation Used 


0 


1%. 1 hr. 


tVeshiy-prcparcd 


N 


1%. ovcmisrhl 




2! 


I"o. overniiiht 




35 


IV 48 hrs. 




4<) 


To. 72 hrs. 




10 




stored frozen 


85 






105 







c) ruril1c:iti(>n Of Anti-Bacterial Chicken Antibodies 

(jr<Hips of ibiir ciigs were collecicd per immunizaiion group between da> s 80 and 84 

15 post-initial inimiinizjlion, and ehicken immunoglobulin (IgY) was extracted aeeordinu to a 

modillcation of tbe procedure of A. Poison ai ai, Immunol. Comm.. ^>:405 (1980). A gentle 
stream t)r distilled water f rom a squirt bottle was used to separate the yolks from the whites, 
and the \i>lks were broken b\ dropping liiem through a funnel into a graduated cylinder. The 
lour individual yolks were pooled for each group. I hc pooled, broken yolks were blended 

20 with 4 \okimes i>f egg extraction buffer to improve antibody yield (egg extraction buffer is 
0.01 M sodium phosphate. 0.1 M NaCl. pH 7.5. containing 0.005% thimerosal). and PlXi 
8000 (Amresco) was added to a eonceniralion of 3 5%. When all the PlXi di.ssolved. the 
protein precipitates thai formed were pelleted by cenlrifugation at 13.000 x g for 10 minutes. 
'I he supernaiants were decanted and filtered through cheesecloth to remove the lipid layer. 

25 and ilie PlXi was added lo the supernaiants lo a final concentration of 12% (the supernalanis 
were assumed to contain 3.5% PEG). After a second ccntrifugation, the supernatanls were 
discarded and the pellets were centriluged a final lime to extrude the remaining PlX.l. fhese 
crude IgY pellets were then dissolved in the original yolk volume of egg extraction buffer and 
stored at 4°C\ As an additional control, a preimmune IgY solution was prepared as described 

30 abo\e. using eggs collected from unimniunized hens. 
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d) Detection Of Anti-Bacterial Antibodies In The Purified I^Y 
Preparations 

In order to evaluate the relative levels of specific aini-(' difficile aciivit)' in the I^Y 
preparations described above, a modiried version of the whole-organism ELISA procedure of 
5 N.V. Padhye ct ciL. J. Clin. Microbiol. 29:99-103 (1990) was used, frozen organisms of both 
r. Jifficilc strains described above were thawed and diluted to a concentration of 
approximately I x 10' organisms/ml using PBS. pi I 7.2. in this way. two separate coaling 
suspensions were prepared, one for each immunizing strain. Into the wells of 96-weil 
microliter plates (falcon, i*ro-Bind Assay Plates) were placed 100 ^il volumes of the coaling 
10 suspensions. In this manner, each plate well received a total of approximately I x Uf 

organisms ol one strain or the other. The plates were then incubated at 4°C overnight. The 
next morning, the coaling suspensions were decanted, and all wells were washed three times 
using PBS. In order to block non-specific binding sites. 100 |.il orO..S% BSA (Sigma) in PBS 
was then added to each well, and ihe plates were incubated lor 2 hours at room temperature. 
15 file blocking solulit>n was decanied. and 100 nl volumes of the IgY preparations described 
above were initially diluted 1:500 with a solution of 0.1% BSA in PBS. and then serially 
diluted in i :5 steps. The following dilutions were placed in the wells: 1:500. 1:2.500. 
1:62.5000. 1:1] 2.500. and 1:1.562.500. The plates were again incubaied for 2 hours at room 
temperature, following this incubation, the IgY-containing solutions were decanted, and the 
20 wells were washed three limes using BBS- f ween (O l M boric acid. 0.025 M sodium borate. 
1.0 M NaC'l. 0,1% Tween-20), followed by two washes using PBS-Tween (O l'li. I ween-20). 
and fmally. two washes using PBS only, fo each well. 100 yi\ of a 1:750 dilution of rabbit 
anti-chicken IgCi ( whole-molecule ^-alkaline phosphatase conjugate (Sigma) (diluted in 0.1% 
BSA in PBS) was added, fhe plates were again incubated for 2 hours at room temperature. 
25 The conjugate solutions were decanted and the plates were washed as described above, 
substituting 50 niM NaXO^ pH 9.5 for the PBS in the llnal wash, fhc plates were 
developed b} the addition of 100 |.il of a solution containing 1 mg/ml para-nitrophenyl 
phosphate (Sigma) dissolved in 50 mM NaXO,. 10 mM MgC'f. pi I 9.5 to each well, and 
incubating the plales at room temperature in the dark lor 45 minutes. I he absorbance of each 
30 well was measured at 410 nm using a Dynalcch MR 700 plate reader. In this manner, each ol 
the four IgY preparations described above was tested for reactivity against both of the 
immunizing (' Jifficilc sirains; strain-specific, as well as cross-reactive activity was 
determined. 
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Tabic 5 shows the results of the whole-organism ELISA. All four IgY preparations 
demonstrated significant levels of activity, to a dilution of 1:62.500 or greater against both of 
the immunizing organism strains. Therefore, antibodies raised against one strain were highly 
cross-rcaclive with the other strain, and vice versa. The immunizing concentration of 
organisms did noi have a significant effect on organism-specific IgY production, as both 
concentrations produced approximately equivalent responses. Therefore, the lower 
imnuinizing concentration of approximately 1.5 x 10*'* organisms/hen is the preferred 
immunizing concentration of the two tested. I he prcimmune IgY preparation appeared to 
pt)s.sess relatively k>w levels of C £/////V/7c-reaciive activity to a dilution of 1:500. probably 
duo u> prior exposure of the animals to environmental Clostridia. 

An initial whole-organism FLISA was performed using fgY preparations made from 
single C I) 4 >5tM, n\ and ( T) 4.>596. n\ eggs collected around day 50 (data not shown), 
Speciilc liters were found to be 5 to lO-foId lower than those reported in Table 5. These 
results demonstrate that it is possible to begin immunizing hens prior to the lime that they 
begin to lay eggs, and to obtain high tiler specific IgY iVom the tlrst eggs that are laid. In 
other ui>rds. it is not necessary to wail for the hens to begin laying before the immunization 
schedule is started. 
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TABLK 5 



KesuUs or The Anii-C difllcilc Whale- Organ ism ELISA 



luY Preparation 


Dilution Of IgY Prep 


43594-Coated Wells 


43596-Coaied Wells 




1:500 


1 746 


1.801 




1:1500 


1.092 


1 .670 




1:12.500 


0.202 


0 812 


C D 43594. ffl 


0. 1 36 




i;62.500 


0.179 




\ : 3 12,500 


\}.\} \ 1 


0.080 




1:1.562.500 


0002 


0.020 




1:500 


1 780 


1.77 1 




1:2.500 


1.025 


1.078 




1:12.500 


0.I8S 


0-382 


CVi a iS04 til 


1:62.500 


0.052 


0.132 




1:312.500 


0.022 


0.043 




1:1.562.500 


0.005 


0.024 




1 :5oo 


1.526 


1 .790 




1 :2.5(K) 


0.832 


1.477 


V 1 t M _>>"(>. 1 


1:12.500 


0.247 


0.452 




1:62,500 


0.050 


0.242 




1:312.500 


0 010 ■ 


0.067 




l:L.%2.500 


0.000 


0.036 




1 :500 


1 .702 


1 .505 




1:2.500 


0.706 


0.866 


CM> 435^>6. '-7 


1:12.500 


0.250 


0.282 


1:62.500 


0.030 


0 078 




1:312.500 


0.002 


0.017 




1:1.562.500 


0.000 


0.010 




1:500 


0.142 


0.3(»^> 




1:2.500 


0.032 


0.077 


Prciiiimunc luY 


1:12.500 


0.006 


0.024 


l:62..S00 


0.002 


0.012 




1 312.500 


04)04 


0.010 






0.002 


0.014 



10 EXAMPLE 2 

Treatment Of ( *. Jiffiale Inreclion With Anli-C ". idfficilc AntilxKiy 

In order to determine whether the immune IgY antibodies raised against whole C . 
ilifiicilc organisms were capable of inhibiting the infection of hamsters by ( ; (iifficilc. 
15 hamsters infected by these bacteria were utilized. jLyerly ct td.. Infect. Immini., 5^>:2215- 
2218 (IWD.l This example involved: (a) determination of the lethal dose of C*. Jiffk'ifc 
organisms: and (b) ireatmeni of inlccled animals with immune aniibt>dy or control antibody in 
nutritional solution. 
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a) Determination Of The Lethal Dose Of C difficile Organisms 

Determinaiion of the lelhal dose of C. Jifridlc organisms was carried out according to 
the model described by D.M, Lyerly ci aL. Infect. Immun.. 59:2215-2218 (IWl). C. Jifficile 
strain A I'CC 43596 (scrogroup C\ ATCC) was plated on BHI agar and grown anacrobicaliy 
5 (BBL Cius Pak 100 system) al 37^C Ibr 42 hours. Organisms were removed from the agar 
surface using a sterile dacron-lip swab and suspended in sterile 0.9% NaCl si>iution to a 
density of 10^ organisnis/ml. 

In order to determine the lelhal dose of C\ ciifficilc in the presence of control antibody 
and nutritional formula, non-immune eggs were obtained from unimmunized hens and a 12% 
10 F^F.Ci preparation made as described in Example 1(c). This preparation was redissolvcd in one 
fourth the original yolk volume of vanilla Havor Ensure*. 

Starling on day one. groups of female Golden Syrian hamsters (Marian Sprague 
Dawle\ ). 8-9 weeks old and weighing approximately 100 gin. were orall\ administered I ml 
ol the preimmune/Ensure H formula at time zero. 2 hours. 6 hours, and 10 hours. .\l I hour. 
15 annnals were orally administered 3.0 mg clindamycin HCI (Sigma> in I ml of water. This 
drug predisposes hamsters to C \ Jifjficilc infection by altering the nomiai intestinal flora. On 
day two. the animals were given 1 ml of the preimmune IgY/Ensure^H* formula at lime zero. 2 
hours. (^ hours, and 10 hours. At 1 hour on day two. different groups of animals were 
inoculated orally with saline (control), or ]()\ 10^ or 10** C c/ifficik* organisms in I ml 
20 of saline, f rom days 3-12. animals were given I mi oi' the preimmune IgY/Easure formula 
three limes dailv and observed for the onset of diarrhea and death. Each animal was housed 
in an indix idual cage and was offered food and water cuUihitum. 

.Administration of Mf - Mf organisms resulted in death in 3-4 days while the lower 
doses of to" - 10^ organisms caused death in 5 days. Cecal swabs taken from dead animals 
25 indicated the presence of cliffici(c. Ciiven the effectiveness of the 10" dose, this number of 
organisms was chosen for the following experiment to see if hyperimmune anti-C difficile 
antibody could block infection. 



b) Treatment Of Infected Animals With Immune Antibody Or 
30 Control Antibody In Nutritional Formula 

I he experiment in (a) was repealed using three groups of seven hamsters each, (noup 
A received no clindamycin or L\ difficile and was the survi\al control. Group B received 
ciindam>cin. 10" C \ difficile organisms and preimmune IgY on the same schedule as the 
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animais in (a) above. Group C received clindamycin. 10" (\ Jifjficile organisms, and 
hyperimmune anti-C . Jifficile IgY on the same schedule as (iroup I hc antiC difficile IgY 
was prepared as described in Example I except that the 12% PFXi preparation was dissolved 
in one Iburth ihc original yolk volume of tnsureifc. 
5 All animals were observed for the onset of diarrhea or other disease symptoms and 

death, tach animal was housed in an individual cage and was offered food and water ucl 
lihiium. The results are shown in Table 6. 



TABLE 6 

The rJTect Of Oral Feedinii Of Hyperimmune (uY Antibody on < Ji/ftci/c liitcctit)n 



Animal Group 


Time To Diarrhea"' 


Time To Death'' 


A 


(>re-immune IgY only 


no diarrhea 


no deaths 


H 


C lindamycin. tHfficHc. prcimmune IsiY 


if) hrs 


49 hrs. 


C 


CTindamycin. (" difftvUc, immune luY 


>3 hrs. 


56 hrs. 



i 5 Mean of seven animals. 

I lamslers in the eonlroi uroup A did not develop diarrhea and remained healthy diirinii 
the experimental period. Hamsters in groups B and C developed diarrheal disease, Anti-C 
tfifficilc 1«:Y did not protect the animals from diarrhea or death, all atiimals succumbed in the 
same time interval as the animals treated with preimmune luY. I hus, while immunization 

20 with whole orizanisms apparenlh can improve sub-lethal symptoms with particular bacteria 
(see l].S. Patent No. 5.080,895 to 11. Tokoro). such an approach does not prove to be 
pioduciive to protect against the lethal ef fects of ( '. i/i/fialc. 

EXAMPLE 3 

25 Production of (' hotulinuni Type A Antitoxin in liens 

In order to determine whether antibodies could be raised against the toxin produced by 
clostridial pathogens, which would be effective in treating clostridial diseases, antitoxin to (' 
halulimim type A toxin was produced. This example involves: (a) toxin modification: (b) 
30 immunization: (c) antitoxin ct>llection: (d) antigenicity assessment: and (e) assay of antitoxin 
titer. 
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a) Toxin Modification 

( * hotuUmtm type A toxoid was obtained from B. R. DasGupta. From this, the active 
type A neurotoxin (M-W. approximately 150 kD) was purified to greater ihan 99% purity, 
accordini! to published methods. |B.R. DasGupla & V. Sathyamoorihy. Toxicon. 22:415 
5 {1984).) rhe neurotoxin was detoxified with formaldehyde according to published methods.^ 
|B.R- Singh & \U\. DasCiupia- Toxicon. 27:403 (1989).j 



, Vi\ Immunization 

(* hondimun toxoid for immunization was dissolved in PBS (I mg/ml) and was 
10 emulsiiied with an approxunately equal volume of CFA (G[BCO) for initial immunizaiion or 
1 1- A for boDsier immunization. On day zero, two while leghorn hens, (ihiained f rom local 
breeders, were each injected at multiple sites (intramuscular and subcutaneous) with 1 nil 
inaclixaied uixoid emulsilled in I ml CTA. Subsequent btwsier immunizations were made 
according to the following schedule Ibr day of injection and toxoid amount: days 14 and 21 - 
15 (*.5 mg: day 171 - 0.75 mg: days 394. 401. 409 - 0.25 mg. One hen received an additional 
booster of 0 150 mg on day 544. 



c) .Antitoxin Collection 

I ota! \olk immunoglobulin (IgY) was extracted as described in Fxample 1(c) and the 
20 IgY peilei w;is dissolved in the original yolk volume of PBS vviih thimerosaf 



(1) Antigenicity A.ssessmcnt 

l:ggs were collected from dav 409 throuuh day 423 to assess whether the toxoid was 
sufHciemly immunogenic to raise anlibtKly. Kggs from the two hens were pooled and 

25 antibody was collected as described in the standard PKG protocol. [Example 1(c). j 

Antigenicity of the botulinal toxin was assessed on Western blots, fhe 150 kD detoxified 
type A neurotoxin and unmodilled. toxic. 300 kD botulinal type A complex (toxin used for 
intragastric route administration for animal gut neutralization experiments: see Kxample 6) 
were separated on a SDS-polyacrylamide reducing gel. The Western blot technique was 

30 performed according to the method of Towbin. fll. Towbin ci oL, Proc. Natl. Acad. Sci. 

I 'SA. 76:4350 (1979).| Ten |.ig s;tmples of hotulimim complex and toxoid were dissolved 
in SDS reducing sample buffer (1% SDS. 0.5% 2-mercaptoelhanol. 50 mM I ris, pi I 6.8. 10% 
glycerol. 0.025% w/v hromphenol blue. 10% p-mercaploethanol). heated at 95^C ibr 10 min 
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and separated on a 1 mm thick 5% SDS-polyacr>'lamide gel. (K. Weber and M. 
Oshorn" Prof cws and Sodium Dodecyl Sulfate: Molecular IVei^hf Determination on 
Poivaay/anUde (tels and Related Procedures.*' in The Proteins, 3d Edition (IL Neurath & 
R.L. llill. eds). pp. 179-223, (Academic Press, NY. I975).| Part of the eel was cut ofT and 
the proteins were stained with Coomassie Blue. The proteins in the remainder of the gel were 
transferred to nilroceilulose using the Millibiol-SDl: electro-blotiing system (MilJipore) 
according to mantifacturcr s directions. The nitrocellulose was temporarily stained with 10% 
Ponceau S fS.B. C arroll and A. Laughon. ''Production and Purification of Polyclonal 
Antib(}dies to the harden Sc^tnent of (i-^alactosidase f usion Proteins."' in DNA Cloning: A 
Practical Approach, Vol.111. (D. Glover, ed.), pp. 89-11 1. IRL Press. Oxford. (1987)] to 
visualize the lanes, then destained by running a gentle stream of distilled water over the blot 
for .several minutes. I he nilroceilulose was immersed in PBS containing 3% BSA overnight 
at to block any remaining protein binding sites. 

The blot was cut into sirips and each strip was incubated with the appropriate primary 
antibod). I he avian anii-C hotulinum antibodies [described in tc)l and pre-immune chicken 
antibody <as control) were diluted 1:125 in PBS containing I mg/ml BSA lor 2 hours ai room 
temperaiure. I he bk»ls were washed with two changes each of large volumes t)i' PBS. BBS- 
fween and PBS. successively (10 min/wash). Goal anti-chicken IgCi alkaline phosphatase 
conjtigalcd secondary antibody (Fisher Biotech) was diluted 1:500 in PBS containing I mg/ml 
BSA and incubated with the blot for 2 hours at room temperature. I he blots were washed 
with two changes each of large volumes of PBS and BBS- l wcen. followed by one change o! 
PBS and 0.1 M Tris-IK'I. pU 9,5. Blots were developed in freshly prepared alkaline 
phosphatase substrate buffer (100 ^tg/ml nitroblue leirazolium (Sigma). 50 |.ig/ml 5-bromo-4- 
chloro-3-indolyl phosphate (Sigma). 5 niM MgCU in 50 mM Na^COv pH 9.5). 

fhe Western blots are shown in Figure I. I he anti-f hoiulinum luY reacted to the 
toxoid to give a broad imnninoreactive band at about 145-150 kD on ihe reducing gel. fhis 
toxoid is refractive to disulfide cleavage by reducing agents due lo formalin crosslinking. fhe 
immune IgY reacted with the active toxin complex, a 97 kD (' hotulinum type A heavy chain 
and a 53 kD light chain. The preimmune IgY was unreaetive to the i '. hotulinum complex or 
toxoid in the Western blot-. 
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c) Antitoxin Antibndy Titer 

The luY antibody tiler to (\ hotulmum type A toxoid of eggs harvested between day 
40^) and 423 was evaluated by IiLISA. prepared as follows. Nineiy-six-well Falcon Pro-bind 
plates were coated overnight at 4°C with 100 ^I/well toxoid [B.R. Singh & B.R. Das Gupta, 
5 Toxicon 27:403 (n)89)l at 2 J pg/ml in PBS. pH 7.5 containing 0.005% thimerosal. The 

!blknving day the wells were blocked with PBS containing 1% BSA for 1 hour at 37^C. The 
JgY from immune or preimmune eggs was diluted in PBS containing 1% BSA and 0.05% 
I ween 20 and the plates were incubated for 1 hour at 37°C, 1 he plates were washed three 
limes w iih PBS containing 0.05% Tween 20 and three limes with PBS alone. Alkaline 

10 phosphaiasc-conjugated goai-anii-chicken IgG (Fisher Biotech) was diluted 1:750 in PBS 

containing l"'o BSA and 0.05% I ween 20. added to the plates, and incubated 1 hour at 37"*^ 
1 he plates were washed as before, and p-niirophenyl phosphate (Sigma) at 1 mg/ml in 0.05 M 
Na,C C),. pl l 9.5. 1(1 mX'l MgCU was added. 

I hc results arc shown in Figure 2. Chickens immunized with the toxoid generated 

15 high tilers t>f antibody to the immunogen. Importantly, eggs from both immunized hens had 
significani anti-immunogen antibod\ tilers as compared to preimmune control eggs, l lie anti- 
(" hotiilinum IgY possessed significant activity, to a dilution of 1:93.750 or greater. 

EXAMPLE 4 

20 Preparation Of Avian Lgg Yolk Immunoglobulin In An Orally Administrable l orm 

In order to administer avian IgV antibodies orally to experimenlal mice, an effective 
deli\ery formula lor the IgY had to be determined. The concern was that if the crude IgY 
was dissolxed in PBS. the saline in PBS would dehydrate the mice, which might prove 

25 harmful o\er the duration of the study. Therefore, alternative methods of oral administration 
of IgY were tested. The example involved: (a) isola-tion of immune IgY: (b) solubilization 
of IgY in water or PBS. including subsequent dialysis of the IgY-PBS solution with water to 
eliminate or reduce the salts (salt and phosphate) in the buffer; and (c) coniparison of the 
quantity and activity ol recovered IgY by absorbance at 280 nm and PACiH. and enzyme- 

30 linked immunoassay (FTISA). 
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a) Isolation Of Immune IgY 

In order to invesligaie the most effective delivery formula for IgY. we used IgY which 
was raised against Cmiaius ciuhssus iciriftcus venom, l lirce eggs were collected from hens 
immunized with the C ilurivsus tenificus venom and IgY was cxlractcd from the yolks using 

5 the modified Poison procedure described by Thalley and C^irroll |nio/lechnology, 8:934-938 
(1990)1 as described in Example ](c), 

riie egg yolks were separated from the whiles, pooled, and blended with four volumes 
of PBS, I'owdered PECi 8000 was added to a concentration of 3.5%. The mixture was 
cenirifuged at 1 0.000 rpm for 10 minules to pellet the precipitated protein, and the 

HJ supernatant was nitercd through cheesecloth to remove the lipid layer. Powdered PlXi 8000 
was added to the supernalani to bring the fmal PEG concentration to 12% (assuming a I'EG 
concentration of 3.5% in the supernatant). Ihe 12% PEG/lgV mixture was divided into two 
equal volumes and cenirifuged to pellet ihe IgY. 

15 b) Soluhilization Of The I«Y In Water Or PBS 

One pellet was resu.spended in 1/2 the original yolk volume of PBS. and the other 
pellet was resuspended in 1/2 the original yolk volume ol* water, fhe pellets were then 
cenirifuged to remove any panicles or insoluble material. I he IgV in PBS solution di.ssolved 
readily but the fraction resuspended in water remained cloud}'. 

20 In order lo .satisfy anticipated sterility requirements for orally administered aniibodies. 

the antibody solution needs to be niler-sterilized (as an alternati\e lo heat stcrili/iition which 
would destroy the antibodies). I he preparation ol' IgY resuspended in water was too cloudy 
to pass through either a 0.2 or 0.45 |,im membrane filter, so 10 ml tif the PBS resuspended 
fraction was dialyzed overnight at loom lemperalure again.st 250 ml i>f water. I lie following 

25 morning the dialysis chamber was emptied and refilled wiih 250 ml t>f tresh ll .C) for a second 
dialysis. Thereafter- the yields of soluble antibody were determined at C)D and are 
compared in fable 7. 
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TABLE 7 

Dependence Ol* !gY Yield On Solvents 



Fraction 


Absorbance Of l;IO Dilution At 280 nm 


Percent Recover>' 


PBS dissolved 


!.I49 


100% 


n,() dissolved 


0.706 


61% 


I>BS dissolved. H.O diaWzed 


0.885 


77% 



Rcsuspending the pellets in PBS followed by dialysis aeainst water recovered more 
antibody than directly resuspendini: the pellets in water (77% versus 61%). Equivalent 
10 volumes ol" the \iiY preparation in PBS or water were eompared by PAGF:, and these results 
were in accordance with the absorbance values (data not shown). 



c) Activity Of IgV Prepared With Different Solvents 

An 1:1.ISA was pcrlornied to compare the binding acti\'iiy ot the luY extracted by 
15 each procedure described above- (*. ifurixsus fcrnficus (C .ilj ) \enom at 2.5 f-ig/mi in PBS 

was used to coal each well i*! a *)6-\vell niieroliter plate. The remaining protein binding sites 
were blt)cked with PBS containing 5 mg/ml BSA, Primary antibody dilutions (in PBS 
containing I mgaiil BSA) were added in duplicate. After 2 hours of incubation at room 
temperature, the unbound primary antibodies were removed by washing the wells with PBS. 
20 IMiS-Twecn. and PBS. t he species specific secondary antibody (goat ami-chicken 

immunoglobulin alkaline-phosphatase conjugate (Sigma) was diluted 1:750 in PBS containing 
I mg/nil BSA and added each well of the microliter plate. Atter 2 hours of incubation at 
room temperature, the unbound secondary antibody was removed by washing the plate as 
before, and freshly prepared alkaline phosphatase substrate (Sigma) at I mg/ml in 50 miVl 
25 NaX'O:. 10 mM MgCK. pi I ^>.5 w as added to each well. The color development was 

measured on a Dynateeh MR 700 microplate reader using a 412 nm filter, fiie results are 
shown in Table 8. 

fhe binding assay results parallel the recovery values in Table 7, with PI3S-dissolved 
IgY showing slightly more acii\ ity than the PBS-dissoived/M>0 dialyzed antibody. The 
30 water-dissolved" antibody had considerably less binding activity than the other preparations. 



- 55 - 



^O98A)S540 



PCT/US97/15394 



EXAMPLE 5 

Survival Of Antibody Activity After Passage Through The Ciastrointcstinal Tract 

In order to determine the feasibility of oral administration of antibody, it was of 
5 interest to determine whether orally administered IgY survived passage through the 

gastrointestinal tract. The example involved: (a) oral adminisiraiion of specific immune 
antibody mixed with a nuiriiionai formula: and (b) assay of antibody activity extracted from 
feces. 

TABLE 8 

H) Anliiicn-Hindinii Activity Of IcY Prepared With OiUcrenc Solvents 



Oilution 


Preimmunc 


PBS Di.s.solvcd 


M,0 Dissolved 


PBS/I KG 




0 005 


I.74R 


1.577 


1.742 


\ 2Mn\ 


0.004 


0.644 


0.340 


i).(m 


I 12-500 


0.001 


0.t44 


0.054 




1 :(>2.5(U) 


0.001 


0.025 


0.007 


0.016 


1::? 12.500 


0.010 


0.000 


0 1)00 


0.002 



a) Oral Administration Of Antibody 

fhe \\iY preparations used in this example are the same l*HS-disst)lved/ll.C) dialyzcd 
20 antivenom materials obtained in Hxample 4 above, mixed with an equal \ ohimc of I-jifamilm-. 
Vwo mice were used in this experiment, each receivinu a diflereni diet as lollows: 

1 ) vxaicr and food as usual: 

2) immune III Y preparation dialyzcd auainst water and mixed 1:1 with l-nfamihK . 
( fhe mice were given the corresponding mixture as their only st>urce of lood and water). 

25 

b) Antibody Activity After Ingestion 

Alter both mice had ingested their respective lluids. each tube was reftlled with 
approximately 10 ml of the appropriate fluid first thing in the moining. liy mid-morning 
there was about 4 to 5 ml of liquid left in each tube. At this point slool siunples were 
."^O collected from each mouse, weighed, and dissolved in approximately 500 i^il PBS per 100 nig 
stool sample. One hundred and sixty mu of control stools (no antibody) and W mg of 
experimental stools (specific antibody) in 1.5 ml microfuge tubes were dissolved in 800 and 
500 pi PBS. respectively, fhe samples were heated at j7°C for H) minutes and vortexed 
vigorous!} . The experimental stools were also broken up with a narrow spatula. Each sample 
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was centrifuged for 5 minutes in a microfuge and the supernatants, presumably containing the 
antibody extracts, were collected. The pellets were saved at 2-8°C in case future extracts 
were needed. Because the supernatants were tinted, they were diluted five-fold in PBS 
containing I mg/ml BSA for the initial dilution in the enzyme immunoassay (FLISA). The 
primary extracts were then diluted live-fold serially from this initial dilution, fhe volume of 
prinKir\- extract added to each well was 190 |.iL The ELISA was performed exactly as 
described in Example 4. 



TABLE 9 

Specitlc Amibody Activity Alter Passage Fhrouah Tlic (iasiroiinesiinai Iraci 



Dilution 


Preimmune IgY 


Control Fecal Extract 


EXP. Fecal Extract 


1:5 


0 


0.0(K) 


0,032 


1:25 


0.016 




0.016 


1:125 


0 


0 


0.009 


1:625 


0 


O.OOi 


0.001 


I:,il25 


0 


0 


0.000 



Fhere uas some active antibody in the fecal extract IVom the mouse given the specific 
antibody in finfamil K formula, but it was present at a very low level. Since the samples were 
assayed at an initial 1 :5 dilution, the binding observed could have been higher with less dilute 
samples. ( onsequently. the mice were allowed to continue ingesting either regular food and 
water or the specific IgY in Enfamil « formula, as appropriate, so (he assay could be repeated. 
Another 111. ISA plate was coated overnight with 5 f,ig/nil ol C.ii.f. venom in PB.S. 

I he lollowing morning the ELISy\ plate was blocked with 5 mg/ml BSA. and the fecal 
samples were extracted as before, except that instead of heating the extracts at 37°C. the 
.samples were kept on ice to limit proteolysis. The samples were assayed undiluted injliallv. 
and in 5X serial dilutions thereafter. (3iherwise the assay was carried out as bcFore. 
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TABLE 10 

Specific Aniibody Survives Passage Throuuh The Gastroiniesiinal Tract 



Dilution 


Preimnnme leY 


Control Fxiract 


Exp. Extract 


undiluted 


().(>03 


0 


0.379 


1:5 


-0 


0 


0.071 


1:25 


0.000 


0 


0.027 


1:125 


0.003 


0 


0.017 


1:625 


0.000 


0 


0.008 


t:3l25 


0.002 


0 


0.002 



The experiment confirmed the previous rcsulis. wiih liie aniibody aciivii\ niarkcdlv 
higher. The control fecal cxlract showed no dnu-C .df. activity, even undiluted, while the 
lecal extract from the anii-( 'c/./. IgY/IZnfamiKft-fcd mouse showed considerable anii-C.c/./. 
activity. I his experiment (and the previous experiment) clearly demonstrate that active IgY 
15 aniibody survives passage throuizh the mouse diuesiive iracl. a ilnding with faxorablc 
iniphcalions for ihe success af IgY antibodies administered orally as a therapeutic or 
prophylactic. 

EXAMPLK 6 

20 In Vivo Neutralization Of Type ( hofn/intim 

fype A Neurotoxin By Avian Antitoxin Antibody 

I'his example demonstrated the abihty of PLG-purified antitoxin, collected as 
described in Hxainpie 3. to neutralize the lethal effect of (* hofitiinttm neurotoxin type A in 

25 mice. l o determine the oral lethal dose (LD^^jj) of toxin A. groups o\' BALB/c mice were 
given different doses of toxin per unit b<idy weight (average body weight of 24 grams), lor 
oral administration, toxin A complex, which contains the neurotoxin associated with other 
non-toxin proteins was used. This complex is markedly more ti)\ic than purified neurotoxin 
when given by the oral route. If Ohishi ci al.. Infect, Immun., 16:l()(> (l*)77).l ('. hotidimim 

30 toxin type A complex, obtained iVom F>ric Johnson (University Of Wisconsin. Madison) was 
250 j.ig/ml in 50 mM sodium citrate, pil 5.5. specific toxicity 3 x 10' mouse l,D^(/mg with 
parenteral administration. Approximately 40-50 ng/gm body weight was usually fatal within 
48 hours in mice maintained on conventional food and water. When mice were given a diet 
ad libit urn of only Enfamil(K) the concentration needed lo produce lethality was approximately 
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2.5 times higher (125 ng/gm body weight). Botuhnal toxin concentrations of approximately 
200 ng/gm body weight were fatal in mice fed Enfamil® containing preimmune IgY 
(resuspcnded in Enfamil® at the original yolk volume). 

The oral LD,„(, of C. bolulmtim toxin was also determined in mice thai received 
known amounts of a mixture of preimmune IgY-EnsuretHi delivered orally through feeding 
needles. Using a 22 gauge feeding needle, mice were given 250 pi each of a preimmune 
IgY-Ensurc H mixture (preimmune IgY dissolved in 1/4 original yolk volume) \ hour before 
and 1/2 hour and 5 hours after administering bolulinal toxin, loxin concentrations given 
orally ranged from approximately 12 to 312 ng/gm body weight (0.3 to 7.5 fag per mouse). 
Bolulinal loxin complex concentration of approximately 40 ng/gm body weight (1 (.ig per 
mouse) was lethal in all mice in less than 36 hours. 

I wo groups of BALB/c mice. 10 per group, were each given orally a single dose of I 
Lig each of biUulinal loxin complex in 100 |.il of 50 niM sodium citrate pH 5.5. The mice 
received 250 ^il trcaimenis of a mixture of either preimmune or immune IgY in f;nsureK (1/4 
original \oIk volume) 1 hour before and t/2 hour, 4 hours, and 8 hours alter bolulinal loxin 
administration, (he mice received three treatments per day J'or two more da\s. The mice 
were observed for % hours. The survival and mortality are shown in lable 1 1. 



TABLE n 

Ncuirali/ation Botiilinal Toxin A In I'ivu 





Anribody Type 


Number Of Mice Alive 


Number Of Mice Dead 


4 1 .(> 


non- immune 


() 


10 


4I.(> 


anti'holulinni loxin 


to 


0 



AW mice treated with the preimmune IgY-FnsurevK mixture died wiihin 46 hours post- 
toxin administration. The average time of death in the mice was 32 hours post toxin 
administration. Treatments of preimmune lgY-EnsurcO«; mixture did not continue beyond 24 
hours due to extensive paralysis of the mouih in mice of this group. In conirasi. all ten mice 
treated with the immune anti-boiulinai loxin IgY-EnsureiK; mixture survived past *)6 hours. 
Only 4 mice in this group exhibited symptoms of botulism toxicity (two mice about 2 days 
after and two mice 4 days after toxin administration). These mice eventually died 5 and 6 
days later. Six of the mice in this immune group displayed no adverse elTccts to the loxin 
and remained alive and healthy long term. Thus, the avian anli-botulinal toxin antibody 
demonstrated \ cry good protection (Vom the lethal effects of the toxin in the experimental 
mice. 
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EXAMPLE? 

Production Of An Avian Antitoxin Against Closiruihim difficile Toxin A 

Toxin A is a potent cytotoxin secreted by pathogenic strains of ( \ iiifficilc. that plays a 
direct role in damaging gastrointestinal tissues. In more severe cases of C \ difficile 
intoxication, pseudomembrant^us colitis can develop which may be fatal. Fhis would be 
prevented by neutralizing the elTects of this toxin in the gastrointestinal tract. As a first step, 
iinlibodies were produced against a portion of the toxin. The example involved: (a) 
conjugation of a synthetic peptide of toxin A to bovine serum albumin: (b) immunization of 
hens with the pcplide-BSA conjugate: and (c) detection of antitoxin peptide antibodies by 
liLISA. 

a) C onjugation Of A Synthetic Peptide Of Toxin A To Bovine 
Scrum Albumin 

fhe synthetic peptide CQTIDOKKYYI-N-NII, (Si:Q ID N():X2) uas prepared 
commercially (Multiple Peptide Systems. San Diego. CA) and validated u> be -X()% pure by 
high-pressure liquid chromatography. The eleven amino acids l\>Ilowing the cysteine residue 
represent a consensus sequence of a repeated amino acid sequence found in loxin A. (Wren 
ci al . Infect- Immun., 59:3151-3155 (1991).] The cysteine wus added to facilitate 
conjugation to carrier protein. 

In order to prepare the carrier for conjugation. BSA (Sigma) was dissolved in 0.01 M 
NaPO,. pH 7.0 to a fmal concentration of 20 mg/ml and n-maleimidobcn/oyl-N- 
hydrox\succinimide ester (MBS: Pierce) was dissolved in N.N-dimelhyl formamide to a 
concentration of 5 mg/ml. MBS solution. 0.51 ml. was added to 3.25 ml of the BSA solution 
and incubated for 30 minutes at room temperature with stirring every 5 minutes. The MBS- 
activaled liSA was then purified by chromatography on a Bio-Ciel P-IO column (Bio-Rad; 40 
ml bed vi>lume) equilibrated with 50 mM NaPO^. pH 7.0 buffer. Peak fractions were pooled 
(6.0 ml). 

l.yophilized loxin A peptide (20 mg) was added to the activated BSA mixture, stirred 
until the peptide dissolved and incubated 3 hours at room temperature. Within 20 minutes, 
the reaction mixture became cloudy and precipitates formed. After 3 hours, the reaction 
mixture was centrifuged al 10.000 x g for 10 min and the supernatant analyzed for proiein 
content. No significant protein could be detected at 280 nm. 1 he conjuuaie precipiiatc was 
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washed three times with PBS and stored at 4°C. A second conjugation was performed with 
15 mg of activated BSA and 5 mg of peptide and the conjugates pooled and suspended at a 
peptide concentration of 10 mg/ml in 10 inM NaPOj. pH 7.2. 

b) Immunization Of Hens With Peptide Conjugate 

1 wo hens were each initially immunized on day zero by injection into two 
subcutaneous and two intramuscular sites with 1 mg of peptide conjugate that was emulsified 
in CFA (GIBCO). fhe hens were boosted on day 14 and day 21 with 1 mg of peptide 
conjugate emulsiiled in IFA (GIBCO). 

c) Detection Of Antitoxin Peptide Antibodies By CLISA 

IgY was puriHcd from two eggs obtained before immunization (pre-immune) and two 
eggs obtained 31 and 32 days alter the initial immunization using PEG fractionation as 
described in l:\ample 1 . 

Wells of a ^)6-well microliter plate (f alcon Pro-Bind Assay Plate) were coated 
overnight ai 4°C with 100 |,ig/ml solution of the toxin A synthetic peptide in PBS, pH 7.2 
prepared by dissolving 1 mg of the peptide in l.O ml of HjO and dilution of PBS. The pre- 
immunc and immune IgY preparations were diluted in a live- Ibid series in a buffer containing 
1% PI:G SOOO and 0.1% 1ween-20 (v/v) in PBS, pi I 7.2. The wells were blocked for 2 
iiours ai room temperature with 150 |.il of a solution containing 5% (v/v) C arnaiioniKf nonfat 
dr\ milk and 1% PEG 8000 in PBS. pll 7.2. After incubation for 2 hours at room 
icmperaiure. the wells were washed, secondary rabbit anti-chicken IgG-alkaline phosphatase 
{ 1:750) added, the wells washed again and the color development obtained as described in 
l!\ample 1. fhe results are shown in Table 12. 



TABLE 12 

Rcnciiviiy Of IgY With Toxin Peptide 



Dilulion Of Pl^G Prcp 


Absorbatice At 410 nm 


Preimmune 


Immune A mi -Peptide 


1 100 


0,013 


0,253 


1:500 


0.004 


0.039 


! :2500 


0.004 


0005 



Clearly, llie immune aniibodies conlain titers against this repeated epitope of toxin A. 
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EXAMPLES 

Produciion Of Avian Antitoxins Against 
Chsihdium difficile Native Toxins A And B 

To determine whether avian aniibodies are effective Ibr the neutralization of C. 
difficile toxins, hens were immunized using native i\ difficile toxins A and B. The resulting 
egg yolk antibodies were then extracted and assessed for iheir ability to neutralize toxins A 
and B in vifto. The Hxamplc involved (a) preparation oT the toxin immunogens. (b) 
immunization, (c) purification of the antitoxins, and (d) assay of toxin neutralization activity. 

a) Preparation Of The Toxin Immuno^^cns 

Both C. difficile native toxins A and B. and difficile toxoids, prepared by ilie 
treatment of the native toxins with formaldehyde, were employed as immunogens. ( difficile 
toxoids A and B were prepared by a procedure which was modified from published methods 
(Hhrich ef id.. Inlect. Immun. 28:1041 (1980). Separate solutions (in PBS) of native C 
difficile u>xin A and toxin B (Tech I.ab) were each adjusted to a conccmration of 0.20 mg/ml. 
and formaldehyde was added to a fmal concentration of 0.4%. Ilie toxin/formaldehyde 
st)lutions were then incubated at l^TC ibr 40 hrs. free formaldehyde was then removeil from 
the resulting toxoid solutions by dialysis against PBS ai 4°C. In previously published reports, 
this dialysis step was not performed. Therefore, free formaldehyde must have been present in 
Ihcir li>xoid preparations, I he toxoid solutions were concentrated, using u Ccnlriprep 
concentrator unit (AmiconK to a fmal toxoid concentration of 4.0 mg/ml. I he two resulting 
preparations were designated as toxoid A and toxoid B. 

( *. difficile native toxins were prepared by concentrating stock .solutions of toxin A and 
toxin B (Tech Lab. Inc). using C entriprcp concentrator units ( Amicon). to a fmal 
concentration of 4.0 mg/ml. 

b) Immunization 

fhc Inst two immunizations were performed using the toxoid A and U)Xoid B 
immunogens described above. A total of 3 different immunization combinations were 
employed, for the first immunization group. 0.2 ml of toxoid A was emulsified in an equal 
volume of Tiler Max adjuvant (CytKx). Titer Max was used in order to conserve the amount 
of immunogen used, and to simplif>' the immunization procedure. This immuni/aiion group 
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was designated "CTA." For the second immunization group. 0.1 ml of toxoid B was 
emulsified in an equal volume of Titer Max adjuvant. This group was designated "CTB." 
For the third immunization group. 0.2 ml of toxoid A was first mixed with 0.2 ml of toxoid 
B, and the resulting mixture was emulsified in 0.4 ml of Tiler Max adjuvant. This group was 
' 5 designated "CTAB." In this way. three separate immunogen emulsions were prepared, with 
each emulsion containing a final concentration of 2,0 mg/ml of toxoid A (CTA) or toxoid B 
<CTB) or a mixture of 2X3 mg/ml toxoid A and 2.0 mg/mi toxoid B (C FAB). 

On day 0. White Leghorn hens, obtained from a local breeder, were immunized as 
follows: («roup CTA. Four hens were immunized, with each hen receiving 200pg of toxoid 

10 A. via two intramuscular (FM.) injections of 50(.d of CTA emulsion in the breast area. 

Cruuf} CTB. One hen was immunized with 200pg of toxoid B. via two FM. injections of 
50|.il i>f CI B emulsion in the breast area. Group CTAB. Four hens were immunized, with 
each hen receiving a mixture containing 200iag of toxoid A and 200^g of toxoid B. via two 
FVF injections of 10{)|.d of CTAB emulsion in the breast area. The second immunization was 

15 iTcrformed 5 weeks later on day ?>5. exactly as described for the first immunization above. 

In order \o determine whether hens previously immunized with C. liiffkile toxoids 
could tolerate subsequent booster immunizations using native toxins, a single hen from group 
CTAB was immunized lor a third time, this lime using a mixture of the native toxin A and 
native loxin B described in section (a) above (these toxins were not formaldehyde-treated, and 

20 were used in their acti\e tbrm). This was done in order to increase the amount (liter) and 

alfiniu of specific antitoxin antibody produced by the hen over that achieved by immunizing 
with toxoids only. On day 62. O.l ml of a toxin mixture was prepared which contained 
200ug of native toxin A and 2()0ug of native loxin B. I his loxin mixture was then 
emulsified in 0.1 ml of filer Max adjuvant. A single C'l AB hen was then immunized with 

25 the resulting immunogen emulsion, via two FM, injections of each, into the breast area. 

This hen was marked with a wing band, and observed for adverse effects for a period of 
approximately I week, after which lime the hen appeared to be in good health. 

liecause the C FAB hen described above tolerated the booster immuniziition with naiive 
toxins A and B with no adverse effects, it was decided to boost the remaining hens with 

30 naiive loxin as well. On day 70. booster immunizations were performed as follows: Group 
CTA. A 0.2 ml volume of the 4 mg/ml native loxin A solution was emulsified in an equal 
volume ol' Titer Max adjuvant. Fach of the 4 hens was then immunized with 200|.ig ol native 
toxin A. as described for the toxoid A immuni/Jilions above. Group CTB. A 50fil volume 
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of the 4 mg/ml native toxin B solution was emulsified in an equal volume of Titer Max 
adjuvant. 1 he hen was then immunized with 200|.ig of native toxin B, as described for the 
toxoid B immunizations above. Croup CTAB. A 0.15 ml volume of the 4 mg/ml native 
toxin A solution was first mixed with a 0.15 ml volume the 4 mg/ml native toxin B solution. 
5 The resulting toxin mixture was then emulsified in 0.3 ml of Titer Max adjuvant. The 3 
remaining hens (the hen with the wing band was not immunized this time> were then 
immunized with 200|.ig of native toxin A and 200|.ig of native loxin B as described lor the 
toxoid A ' toxoid B immunizations (CTAB) above. On day 85. all hens received a second 
booster immunization using native toxins, done exactly as described for the first boost with 
10 native toxins above. 

All hens tolerated both booster immunizations with native toxins with no adverse 
effects. As previous literature references describe the use of formaldehyde-treated toxoids, 
this is apparently the first lime that any immunizations have been performed using native ( 
lUtftctlv toxins. 

15 

c) Purification Of Antitoxins 

Hggs were ct)llecied from the hen in group CTB 10-12 days following the .second 
immuniz.:uion with toxoid (day 35 immunization described in section (b) above), and from the 
hens in groups CI A and CTAB 20-21 days following the second immunization with toxoid. 

20 lo be used as a pre-immune (negative) control, eggs were also collected from unimmunized 
hens from the same llock. F.gg yolk immunoglobulin (IgY) was extracted from the 4 groups 
of eggs as described in Example 1 (c). and the final IgY pellets were soiuhili/ed in the 
i>riginal yolk volume of PBS without ihimerosal. Importantly, ihimerosal was excluded 
because it would liave been toxic to the CHO cells u.sed in the toxin neutralization assays 

25 described in section (d) below. 

d) Assay Of Toxin Neutralization Activity 

1 he loxin neutralization activity of the IgY solutions prepared in section (c) above was 
determined using an assay system that was modified from published methods. |Hhrich ct al.. 
30 Infect. Immun. 2R:104I-1043 (1W2): and McGee et aL Microb. Path. 12:333-341 (1W2).1 

As additional controls, affinity-purified goal anii-C\ diffkilc toxin A ( l ech Lab) and affinity- 
purified goat anti-C. difficile toxin B (Tech Lab) were also assayed for toxin neutralization 
activity. 
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The IgY solutions and goat, antibodies were serially diluted using F 12 medium 
(GIBCO) which was supplemented with 2% FCS (GIBCO)(this solution will be referred to as 
"medi uni ibr the remainder of this Hxample). The resuilinu anlibodv solutions were then 
mixed with a standardized concentration of either native ( '. difficile toxin A (Tech I.ab), or 

' ? native lUfficilc toxin B (Tech Lab), at the concentrations indicated below. Following 

incubation at 37°C fbr 60 min.. lOOj.iI volumes of (he toxin ^ antibody mixtures were added 
lo (he wells of %-wcil microtiter plates (Falcon Microiesi III) which contained 2.5 x 10^ 
Chinese Hamster Ovary (CTK)) cells per well (the CIKJ cells were plated on the previous day 
to allow ihem to adhere to the plate wells). The final concentration of toxin, or dilution of 

10 anlibtidy indicated below refers to the llnal test concentration of each reagent present in the 
respecti\c microliter plate wells. Toxin reference wells were prepared which contained CHO 
cells and toxin A or toxin B at the same concentration used lor the toxin plus antibody 
mixtures (these wells contained no antibody). Separate control wells were also prepared 
which contained CHO cells and medium only. The assay plates were then incubated for 18- 

15 24 hrs. in a 37^*0', humidified. 5% CO. incubator. On the following day. the remaining 

adherent (viable* cells in the plate wells were stained using 0.2% crystal violet ( Mailinckrodt) 
dissolx cd in 2% eihanol. Ibr 10 min. Fxcess stain was then remoxed by rinsing with water, 
and the stained cells were solubilized by adding l()Of.il of 1% SDS (dissolved in water) to 
each well. I he absorbance oi each well was then measured al 570 nm. and the percent 

20 cMotoxicily k\\' each lest sample or mixture was calculated using the following formula: 



% CHO Cell Cytotoxicity - |1 ' ( ^^■^^ Sample ^ 

Abs. Control 

Wnlike previous reports which quantitate results visually by counting cell lounding by 
nucroscopy. ihis txample utilized spectrophotomelric methods to quantitate the ( Jifficilc 
toxin bioassay. In order to determine the toxin A neutralizing activity of the CTA. CTAB. 
and pre-imniune IgY preparations, as well as the affmity-purilied goal antitoxin A control 
dilutions of these antibodies were reacted agamst a 0. i[.ig/ml concentration of native toxin A 
(this is the approx. 50% cytotoxic dose of toxin A in this assay system), fhe results are 
shown in I'igure 3. 

Complete neutralization of loxin A occurred with the CI A IgY (antitoxin A. above) at 
dilutions of 1:80 and lower, while significant neutralization occurred out to the 1:320 dilution. 
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The CTAB IgY (antitoxin A + toxin B. above) demonstrated complete neutralization at the 
1:320-1:160 and lower dilutions, and significant neutralization occurred out to the 1:1280 
dilution. The commercially available arfmity-purificd goal antitoxin A did not completely 
neutralize toxin A at any of the dilutions tested, but demonstrated significant neutralization 
5 out to a dilution of 1:1.280. The preimmune IgY did not show any toxin A neutralizing 

activity al any of the concentrations tested. These results demonstrate that IgY purified from 
eggs laid hy hens immunized with toxin A alone, or simultaneously with toxin A and toxin B. 
is an efleclive toxin A antitoxin. 

The toxin B neutralizing activity of the CTAB and pre-immunc IgY preparations, and 

10 also the affmity-purified goat antitoxin B control was determined by reacting dilutions of 
these antibodies against a concentration of native toxin B of 0 1 ng/ml (approximately the 
50% cyu>toxic dose of toxin B in the assay system), fhc results arc shown in f igure 4. 

C omplete neutralization of toxin B occurred with the C lAFi IgY (antitoxin A ^ toxin 
B. above) al the 1:40 and lower dilutions, and significant neutralization occurred out to the 

15 1:320 dilution, fhe alfinity-purified goal antitoxin B demonstrated complete neutralization at 
dilutions oi^ \ :Mi) and lower, and significant neutralization occurred out to a dilution of 
1:2.560. The preimmune IgY did not show any toxin B neutralizing activity at any of the 
concentrations tested. These results demonstrate that IgY purified from eggs laid by hens 
immunized simultaneously with toxin A and toxin B is an effective toxin B antitoxin. 

20 In a separate study, the toxin B neutralizing activity oi'C I B. C I AB. and pre-immune 

IgY preparations was determined by reacting dilutions of these antibodies against a native 
toxin B concentration of 0.1j.ig/ml (approximately 100% cytotoxic dose of toxin B in this 
assay ,s\siem). The results are shown in Figure 5. 

Significant neutralization of toxin B cKcurred with the CTB IgY (antitoxin B. above) 

25 at dilutions of 1:80 and lower, while the CTAB IgY (antitoxin A » loxm B. above) was found 
to have significant neutralizing activity al dilutions of 1:40 and lower. I he preimmune IgY 
did not show any toxin B neutralizing activity at any of the concentrations tested, fhese 
results demonstrate that IgY purified from eggs laid by hens immunized with toxin B alone, 
or simultaneously with toxin A and toxin B. is an effective toxin B antitoxin. 

30 
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EXAMPLE 9 

In vivo Protection Of Golden Syrian Hamsters From 
C. difficile Disease By Avian Antitoxins Ajiainst difficile Toxins A And B 

5 The most extensively used animal model to study ( Jiffkile disease is the hamster. 

[Lycrly vi ui. hiteci. Immun. 47:349-352 (1992).) Several other animal models ibr 
aniibioiic-induced diarrhea exist, but none mimic the human form of the disease as closely as 
the hamsier model. | R. l ekely. "Animal Madcis of AntihiodC'InducLxl Colitis." in O. Zak and 
M. Sande (eds.). Experimented Models in Antimicrohicd Chemodierapw Vol. 2. pp.6 1-72. 

10 (I9K6).| In ihis model, the animals arc first predisposed to the disease by the oral 

administration of an antibiotic, such as clindamycin, which alters the population of normally- 
t)ccurrinu ga.sirointesiinal (lora (Fckcly. at 61-72). Follouinii the oral challenge of these 
animals with viable (* difficile organisms, the hamsters develop cecitis, and hemorrhage, 
uiceriuion. and inilammaiion are evident in the intestinal mucosa. [Fyerly e/ aL, Infect. 

15 imnum. 47:349-352 { I985).| The animals become lethargic, develop severe diarrhea, and a 
high percentage of them die from the disease. |Lyerly et ai. Infect. Immun. 47:349-352 
(1985)1 I bis model is therefore ideally suited for the evaluation of therapeutic agents 
designed for the treaimeni or prophylaxis off. difficile disease. 

The ability <W the avian C. difficile antitoxins, described in lixample 1 above, to 

20 protect hamsters from ( ' difficile disease was evaluated using the Golden Syrian hamster 

model of r. difficile infection, t he Fxample involved (a) preparation of the avian ( difficile 
aniiti>xins. (b) /;/ vivo protection of hamsters from ( *. difficile disea.se b\ treatment with avian 
antitoxins, and tc) long-term survival of treated hamsters. 

25 a) Preparation Of The Avian C difjtciie Antitoxins 

Liggs were collected Irom hens in groups CTA and CTAB described in Example 1 (b) 
alcove, fo be used as a pre-imniune (negative) control, eggs were also purchased from a local 
supermarket- ligg yolk immunoglobulin (IgY) was extracted from the 3 groups of eggs as 
described in Example I (c). and the final IgY pellets were solubilized in one fourth the 

30 originaJ \olk x olume oi" Ensure 6 nutritional formula. 
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b) lit vivo Protection Of Hamsters Against C difficUe Disease By 
Treatment With Avian Antitoxins 

The avian C. difficile antitoxins prepared in section (a) above were evaluated for their 
ability to protect hamsters from (* Jijficilc disease usinii an animal mode! system which was 
5 modified from published procedures. |Fekcty, at 61-72: Borricllo tV al . J. Med. Microbiol., 
24:53-64 { 1087): Kim e( a!.. Infect, hiimun.. 55:2984-2992 (1987): Borriello ct ciL. J. Med. 
Microbiol.. 25:191-196 (1988): Dclmee and Avesani. J. Med. Microbiol., 33:85-90 ( 1990): 
and Lyerly cf aL. infect. Immun.. 59:2215-2218 (1991).| f or the study, three separate 
experimental groups were used, with each group consisting ol* 7 female (iolden Syrian 
10 hamsters (Charles River), approximately 10 weeks t)ld and weighing approximately 100 gms. 
each. I he three groups were designated "CTA," "C I AB" and "Pie-immunc." These 
designations corresponded to the antitoxin preparations with which the animals in each group 
were treated. Hach animal was housed in an individual cage, and was i>ifcred ibod and water 
ulI hhifum through the entire length of the study. On day 1. each animal was orally 
15 administered 1.0 ml of one of the. three antitoxin preparations t prepared in section (a) above) 
at the following limcpoinis: 0 hrs.. 4 hrs.. and 8 hrs. On da> 2. the day 1 treatment was 
repealed. On da> 3. al the 0 hr. limcpoinl. each animal was again admihisiered antitoxin, as 
described above. At 1 hr. each animal was orally administered 3.0 mg of ciindamycin-l ICl 
(Sigma) in 1 ml of water. Ihis treatment predisptised the animals to infection with C. 
20 di/fkilc. As a control for possible endogenous C \ ilifficilc colonization, an additional animal 
iVom the same shipment (untreated) was also admini.siered 3.0 mv. of clindamycin-HCl in the 
same manner. I his clindamycin control animal was left untreated (and uninfected) for the 
remainder of the study. At the 4 hr. and 8 hr. timcpomts. the animals were administered 
antitoxin as described alioNC. On day 4. at the 0 hr. timepoinl. each animal was again 
25 administered antitoxin as described above. Al I hr., each animal was orally challenged with I 
ml of C' liifficilc inoculum, which contained approx. 100 C diflicile strain 43596 organisms 
in sterile saline. ( difficile strain 43596. which is a serogroup C strain, was cho.sen becau.se 
it is representative of one of the most frequently-occurring serogroups ist>lated from patients 
with aniibiotic-associaied pseudomembranous colitis. [Delmee cl al , .f Clin. Microbiol., 
30 28:2210-2214 (1985).| In addition, this strain has been previously demonstrated to be virulent 
in the hamster model of infection. |Delmee and Avesani. J. Med. Microbiol.. 33:85-90 
(1990). I Al the 4 hr. and 8 hr. limepoints. the animals were administered antitoxin as 
described above. On days 5 through 13. the animals were administered antitoxin 3x per day 
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as described for day \ above, and observed for the onset of diarrhea and death. On the 
morninii of day 14. the final results of the study were tabulated. These results are shown in 
Table 13. 

Representative animals from those that died in the Pre-lmmune and CTA groups were 
necropsied Viable ( '. diffkilv organisms were cultured from the ceca of these animals, and 
the iiross pathology of the gaslroiulestinal tracts of these animals vvas consistent with that 
expected for ( \ difficile disease (inflamed, distended. Iicmorrhauic cecum. Hlled with watery 
diarrhea-like material). In addition, the clindamycin control animal remained healthy 
ihroughout the entire study period, therefore indicating that the hamsters used in the study had 
not pre\ iousl\ been colonized with endogenous C' difftcile organisms prior lo the start of the 
siud\- follow ing the final antitoxin treatment on day 13. a single surviving animal from the 
CTA group, and also from the C fAB group, was sacrificed and necropsied. No pathology 
was noted in either animal. 



15 



TABLE 13 

( rent men t Rcsulis 



Treatment Group 


No. Animals Survivini; 


No. Animiils Dead 


Pre- Immune 


1 


6 


(TA (Aniiioxin A only) 


5 


■> 


CTAB (Antitoxin A - Antitoxin B) 


7 


0 



20 



.30 



Ifeaiment of hamsters with orally-administered toxin A and toxin li antitoxin (group 
C f AB) successfully protected 7 out of 7 (100%) of the animals from ( *. difficile disease. 
I reaijucni of hamsters with orally-administered toxin A antitoxin (group CTA) protected 5 
out of 7 (71%) of these animals from (* difficile disease. Treatment using pre-immune IgY 
was noi protective against ( difficile disease, as only ! out of 7 (14%) of these animals 
survived. These results demonstrate that the avian toxin A antitoxin and the avian toxin A ^ 
toxin B antitoxin effectively protected the hamsters from ( ' difficile disease. These results 
also suggest lhat although the neutral iziU ion of toxin A alone confers some degree of 
protection against difficile disease, in order to achieve maximal protection, simulianeous 
antitoxin A and antitoxin B activity is necessarv. 



e) Long-Term Surv ival Of Treated Hamsters 

U has heen previously reported in the literature lhat hamsleis treated with ot ally- 
administered bovine antitoxin IgCi concentrate are protected from ( ' difficile disease as long 
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as the treatment is continued, bul when the treatment is stopped, the animals develop diarrhea 
and subsequently die within 72 hrs. [Lyerly el ai. Infect. Immun.. 59(6):22I5-2218 (1991).J 

In order to determine whether trcatmeni o\ i \ clifficUc disease using avian antitoxins 
promotes long-term survival following the discontinuation of treatment, the 4 surviving 
animals in group CTA. and the 6 surviving animals in group d'AB were observed for a 
period of 1 1 days (264 hrs.) following the discontinuation of antitoxin treatment described in 
section lb) abtne. All hamsters remained healthy through the entire post-treatmeni period. 
I his result demonstrates that not only docs treatment with a\ ian antitoxin protect against the 
onset ot (* Ji/ficilc disease (i.e.. it is effective as a prophylactic), it also promotes long-term 
survival beyond the treatment period, and thus provides a lasting cure. 

EXAMPLE 10 

In vivo IVealmeni (^f Lstablishcd ( ' Jifficilc Intcclion In Golden 
Syrian Hamsters With Avian Antitoxins Against (' c/if/iriic Toxins A And B 

fhe ability of the avian C*. difficile antitoxins, described in Example 8 above, to treat 
an established (' i/ifficile infection was evaluated using the (iolden Syrian hamster model. 
I he lixample involved (a) preparation of the avian L\ ififjicile antitoxins, (b) in vivo treatment 
of hamsters with establi.shed ( Jifficilc infection, and {c) histologic evaluation of cecal tissue. 

11) Preparation Of The Avian C difficile Antitoxins 

Hggs were collected from hens in group C I AB described in I'xample K (b) above, 
which were immunized with ( '. difficile toxoids and i^ative toxins A and Lggs purcha.sed 
from a local supermarkel were used as a pre-immune (negative) conirt>L l-gg yolk 
immunoglobulin (IgY) was extracted from the 2 groups of eggs as described in Example I 
(c). and the final IgY pellets were solubilized in one-fourth the original yolk volume of 
Ensure'K nutritional formula. 

b) in vivo Treatment Of Hamsters With Established C. difficile 
Infection 

fhe avian C\ difficile antitoxins prepared in section (a) above were evaluated for the 
ability to treat established C difficile infection in hamsters using an animal model system 
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which was modified from the procedure which was described for the hamster protection study 
in Example 8(b) above. 

For the study, four separate experimental groups were used, with each group consisting 
of 7 female Golden Syrian liamsters (Charles River), approx. 10 weeks old. weighing 
approximately 100 gms, each. Each animal was housed separately, and was otTered food and 
water act lihitum through the entire length of the study. 

On day 1 of the study, the animals in all four groups were each predisposed to C \ 
Jifficilc infection by the oral administration of 3.0 mg of clindamycin-liCI (Sigma) in 1 ml of 
water. 

On day 2. each animal in all four groups was orally challenged with 1 ml of 
( ' ilifficik' inoculum, which contained approximately 100 iiifficilc strain 43596 oruanisms 
in sterile saline. ( , Jifficifc strain 43596 was chosen because it is representative oi one of the 
most Ircquently-occurring serogroups isolated from patients with antibiotic-associated 
pseudomembranous colitis. [Delmee cf aL. J. Clin. Microbiol.. 28:2210-2214 (1990).] In 
addition, as this was the same ( ' (/ifficilc strain used in all of the previous E-xamples alx>ve. it 
was again used in order to provide experimental continuity. 

On day 3 of the study (24 hrs. post-infection), treatment was started for two of the 
lour groups of animals. Each animal of one group was orally administered 1 .0 ml of the 
CTAB IgY preparation (prepared in section (a) above) at the following limepoinis: 0 hrs.. 4 
hrs.. and 8 hrs. I he animals in this group were designated "CT/\B-24." The animals in the 
second group were each orally administered 1.0 ml of the pre-immune IgY preparation (also 
prepared in section (a» above) at the same timepoinis as for the C I AB group, fhese animals 
were designated "Pre-24." Nothing was done to the remaining two groups oJ animals on day 

On day 4. 48 hrs, post-infection, the treatment described for day 3 above was repealed 
for the CTAB-24 and Pre-24 groups, and was initiated for the remaining two groups at the 
same limepoints. 1 he final two groups of animals were designated "(^'fAB-48" and "Pre-48" 
respectively. 

On days d through 9. the animals in all four groups were administered antitoxin or 
pre-imnnine IgY. 3x per day. as described for day 4 above. 1 he four experimental groups are 
summarized in Table 14. 
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X" A III C* 1 vt 

K Ynprimprital I rpntrripnt f irniin*^ 


Orniin Op^i fvnnt inn 


fZxpcrimcnt^il rrcntmcnt 


CTAB-24 


Infected, treatment w/antitoxin luY sinned Ui^- 24 hrs. post-intcction. 


I^re-24 


Infected, treatment w/pre- immune liiY started (a) 24 hrs. post* infect ion. 


CTAB-4S 


Infected, treatment w/ antitoxin li»Y started (w 48 hrs. posi-infection. 


IVe-48 


Infected, treatment w/pre-immune IgY sinned 1r/^ 48 hrs. po.si-intection. 



All animals were observed for ihc onset of diarrhea and death through the conclusion 
10 of the study on the morninu of day 10. The results of this study arc displayed in Table 15, 

TABLE 15 

fixperimental Outcome— Day 10 



1 rcatmcnt Group 


No. Animals Survivins; 


No. Animals Dead 


C TAU-24 


6 


1 


l>re-24 


0 


7 


c:iAB'48 


4 




Pre-48 


2 


5 



IZighty-six percent of the animals which began receivint! treatment with antitoxin IgY 
20 at 24 hrs. post- infection (CTAB-24 above) survived, while 57% of the animals treated with 
antitoxin liiY starling 48 hrs. post-infection {CTAB-48 above) survived. In conlra.si. none of 
the animals receiving pre-inimunc IgY starting 24 hrs. post-infection (Pre-24 above) survived, 
and only 29% of the animals which began receiving ireamient with pre-immune IgY at 48 
ius. post-infection (Pre-48 above) survived through the conclusion of the study, fhcse results 
25 demonstrate that avian antitoxins raised against C\ ilifficilc loxins A and B are capable of 
successfully treating established C Jifficile infections in vivo. 

c) Histologic Evaluation Of Cecal Tissue 

In order to further evaluate the ability of the IgY preparations tested in this study to 
30 treat established C ififficUc infection, histologic evaluations were performed on cecal tissue 

specimens obtained from representative animals from the study described in section ib) above. 

Immediately following death, cecal tissue specimens were removed iVom animals 
which died in the Pre-24 and Pre-48 groups. l*ollowing the completion of the study, a 
representative surviving animal was sacrificed and cecal tissue specitnens weie removed from 
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the CTAB-24 and CTAB-48 groups. A single untreated animal from the same shipment as 
those used in the study was also sacrificed and a cecal tissue specimen was removed as a 
normal control. All tissue specimens were fixed overnight at 4°C in 10% buffered formalin. 
The fixed tissues were paraffin-embedded, sectioned, and mounted on glass microscope slides. 
The tissue sections were then stained using hematoxylin and eosin (H and R stain), and were 
examined by light microscopy. 

I ^pon examination, the tissues obtained from the CTAB-24 and CTAB-48 animals 
showed no pathology, and were indistinguishable from the normal control. This observation 
provides further evidence for the ability of avian antitoxins raised against C*. liifficile toxins A 
and B to effectively treat established C. difficile infection, and to prevent the pathologic 
consequences which normally occur as a result of C". difficik disease. 

In contrast, characteristic substantial mucosal damage and destruction was observed in 
the tissues of the animals tVom the Prc-24 and Pre-48 groups which died from C ' diffkilv 
disease. Normal tissue architecture was obliterated in these two preparations, as most of the 
mucosal la\er was observed to have sloughed away, and there were numerous large 
hemorrhagic areas containing massive numbers of erythrocytes. 

EXAMPLE n 

Cloning And Expression Of C. difficile foxin A I ragmenis 

fhe ti>xin A gene has been cloned and sequenced, and shown to encode a protein of 
predicted MW of .>08 kd. fDove et uL, Infect. Immun.. 58:48()-4K8 ( IW0>.| Given the 
expense and dii'ficulty of isolating native toxin A protein, ii would be advantageous to use 
simple and inexpensive procaryotic expression systems to produce and purify high levels of 
recombinant toxin A protein for immunization purposes. Ideally, the isolated recombinant 
protein would be soluble in order to preserve native antigenicity, since solubilized inclusion 
bod> proteins often do not fold intc) native conformations. To allow ease of purification, the 
recombinant protein should be expressed to levels greater than I mg/liler of E. coli culture, 
fo determine whether high levels of recombinant toxin A protein can be produced in 
coll. iVagments of the toxin A gene were cloned into various prokaryotic expression 
\ectors. and assessed for the ability to express recombinant toxin A protein in E. coli. Three 
prokaryotic expression systems were utilized. These systems were chosen becau.se they drive 
expression of either fusion (pMAlx and pGEXZT) or native (pET23a-c) protein to high levels 
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in t*. coli. and allow affinity purification of the expressed protein on a ligand containing 
column. Fusion proteins expressed from pGEX vectors bind glutathione agarose beads, and 
are eluted with reduced glutathione. pMAL fusion proteins bind amylose resin, and are elated 
with maltose. A poly-histidine tag is present at either the N-terminal (pF/ri6b) or C-tcrminal 
(pFT23u-c) end of pET fusion proteins. This sequence specifically binds Ni> chelate columns, 
and is eluted with imidazole salts. Extensive descriptions of these vectors arc available 
[Williams ci al. (1905) DNA Cloning 2 Expression Sysiems. Glover and Mamcs. eds. IRL 
Press. Oxford, pp. 15-58]. and will not be discussed in detail here. The Example involved (a) 
cloning ol' the loxin A gene, (b) expression of large fragments of toxin A in various 
prokaryotic expression systems, (c) identification of smaller toxin A gene fragments that 
express etficiently in coli, (d) purification of recombinant toxin A protein by affinity 
chromatography, and (e) demonstration of functional activity of a recombinant fragment of the 
loxin A gene. 

a) Cloninf^ Of The Toxin A Gene 

A rcsiriclion map of the toxin A gene is shown in Figure 6. The encoded protein 
contains a carboxy terminal ligand binding region, containing multiple repeats oi' a 
carbohydrate binding domain, jvon Eichel-Streiber and Sauerborn. Cicnc 96:107-1 13 (1990) ] 
The loxin A gene was cloned in three pieces, by using either the polymerase chain reaction 
(PGR) to amplify specific regions, (regions 1 and 2. f igure 6) or by screening a constructed 
genomic library for a specific loxin A gene fragment (region 3. Figure (>). 1 he sequences oi" 
Ihe ulili/ed PGR primers arc PI: 5' (iGAAATT TACiGTGGACK A IGfGAG 3" (SliQ If^ 
N().:n: P2: 5' TCTAGGAAATTGGGTTGT GTTGAA y (SEQ IH NO.:2); P3: 5* 
GTGGGA f ATAGGATTAGAGC 3" (SEQ ID NO.;3): and P4: 5' 
G TAl GTAGGGGTAAAGTAT 3' (SEO ID N0.:4). These regions were cloned into 
prokaryotic expression vectors that express either fusion (pMAIx and pGEX2T) or native 
(pET23a-c) protein to high levels in E. coli, and allow affinity purification of ihe expressed 
protein on a ligand containing column. 

Closthcliiim difjficile VPI strain 10463 was obtained from the A I GG (ATGG //43255) 
and grown under anaerobic conditions in brain-hcarl infusion medium (BBL). High 
molecular-weight G. difficile DNA was isolated essentially as described by Wren and 
fahaqchali (1987) J. Glin. Microbiol., 25:2402. except proteinase K and sodium dodecyl 
sulfate (SDS) was used to disrupt the bacteria, and cetyllrimethylammonium bromide 
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precipitation [as described in Ausubel et ai.. Current Protocols in Molecular Biology^ (1989)] 
was used to remove carbohydrates from the cleared lysate. The integrity and yield of genomic 
DNA was assessed by comparison with a serial dilution of uncut lambda DNA after 
electrophoresis on an agarose gel. 

Fragments 1 and 2 were cloned by PCR. utilizing a proofreading thermostable DNA 
polymerase (native pfu polymerase: Stratagene). The high fidelity of this polymerase reduces 
ihe mutation problems associated with amplification by error prone polymerases (e.^., Taq 
polymerase). PCR amplification was performed using the indicated PCR primers (Figure 6) 
in 50 pi reactions containing 10 niM Tris-HCl(8.3), 50 niM KCl. 1.5 niM MgCl,, 200 ^iM 
each dNTP, 0.2 (.iM each primer, and 50 ng C. Jijficile genomic DNA, Reactions were 
ox erlaid with 100 pi mineral oil. heated to 94*^0 for 4 min. 0.5 \x\ native pfu polymerase 
(Siratagene) added, and the reaction cycled 30x at 94''C for I min. SO^'C: for I min. IT'C for 
4 min. followed by 10 min at 72°C\ Duplicate reactions were pooled, chloroform extracted, 
and ethanol precipitated. After washing in 70% ethanol. the pellets were resuspended in 50 pi 
T\l buffer (10 mM Tris-HCL. 1 mM EDTA pH 8,0]. Aliquots of lOpl each were restriction 
digested with either EcolWHincW (fragment I) or EcoRMPstl (JVagmenl 2). and the 
appropriate restriction fragments were gel purified using the Prep-A-Gene kit (BioRad), and 
ligated to cither /:c7jRj/\/»^/l -restricted pGEX2T (Pharmacia) \ ector (fragment I), or the 
EcoRMPsa pMAlc (New Kngland Biolabs) vector (fragment 2). Both clones are predicted to 
produce in-frame fusions with either the glutathione-S-transfcrase protein (pGRX vector) or 
ihc maltose binding protein (pMAL vector). Recombinant clones were isolated, and 
confirmed by restriction digestion, using standard recombinant molecular biolog> techniques. 
|.Sambroi>k et al.. Molecular Clonini*. A Lahortuory Manual (1989). and designated pGA30- 
660 and pMA660-! 100. respectively {see Figure 6 for description of the clone designations).] 

I ragmeni 3 was cloned from a genomic library of size selected Psi\ digested 
(' clifftcilc genomic DNA, using standard molecular biology techniques (Sambrook at aL). 
(liven that the fragment 3 internal Pst\ site is protected from cleavage in L\ tlifftcilc genomic 
DNA (Price ct ai, Curr. Microbiol., 16:55-60 (1987)], a 4.7 kb fragment from Pst\ restricted 
(' difficile genomic DNA was gel purified, and ligated to Psfl restricted, phosphatase treated 
plJC9 DNA. The resulting genomic library was screened with a oligonucleotide primer 
specific to fragment 3. and multiple independent clones were isolated. The presence of 
fragment 3 in several of these clones was confirmed by restriction digestion, and a clone of 
the indicated orientation (Figure 6) was restricted with BamUl/HindlU. the released fragment 
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purified by gel electrophoresis, and ligated into similarly restricted pt:T23c expression vector 
DNA (Novagen). Recombinant clones were isolated, and confirmed by restriction digestion. 
This construct is predicted to create both a predicted in frame fusion with the pET protein 
leader sequence, as well as a predicted C-terminal poly-histidine affinity tag. and is designated 
5 pPAl 100-2680 (see Figure 6 for the clone designation). 

b) Expression Of Large Fragments Of Toxin A In E, cofi 
Protein expression from the three expression constructs made in (a) was induced, and 
analyzed by Western blot analysis with an affinity purified, goat polyclonal antiserum directed 

10 against the toxin A toxoid (l ech I.ab). The procedures utilized for protein induction, SDS- 
PAGL. and Western blot analysis arc described in detail in Williams e/ a! { 19Q5). supra. In 
brief 5 ml 2X YT (16 g tryptonc. 10 g yeast extract, 5 g NaCI per liter, pi I 7.5 + 100 ii^m\ 
ampicillin were added to cultures of bacteria (BL21 for pMAl and pGHX plasmids. and 
liL21(Dt!.'^)l.ysS for pET plasmids) containing the appropriate recombinant clone which were 

15 induced to express recombinant protein by addition of IPTCi to I mM. Cultures were grown 
at 37^C. and induced when the cell density reached 0.5 ()D,,„„. Induced protein was allowed 
to accumulate for two hrs after induction. Protein samples were prepared by pelleting 1 ml 
aliquois of bacteria by centrifugal ion (I min in a microfuge). and resuspension of the pelleted 
bacteria in 150 yi\ of 2x SDS-PAGE sample buffer {Williams ci uL (!9<>5), supnt\. The 

20 samples were heated to 95X for 5 min, the cooled and 5 or 10 ^1 aliquois loaded on 7.5% 
SDS*PAC'ji^ gels. BioRad high molecular weight protein markers were also loaded, to allow 
estimation of the MW of identified fusion proteins. After electrophoresis, protein was 
delected either generally by staining gels with Coomassie blue, or specificaily. b\ blotting to 
nitrocellulose ibr Western blot detection of specific immunoreaclivc protein. Western blots. 

25 (performed as described in Example 3) which detect toxin A reactive protein in cell lysaies of 
induced protein from the three expression constructs are shown in f igure 7. In this figure, 
lanes 1-3 contain cell lysates prepared from coli strains conlaining pPAl 100-2860 in 
UI2l{I)E3)lysE cells: lanes 4-6 contain cell lysaies prepared from /: coli strains containing 
pPAI 100-2860 in m2l(DE3)lysS cells: lanes 7-9 contain cell lysaies prepared from E, coli 

30 strains containing pMA30-660: lanes 10-12 contain cell lysates prepared from coli strains 
containing pMA660-l 100. The lanes were probed with an alfiniiy purified i:oai antitoxin A 
polyclonal antibody ( fech Lab). Control lysates from uninduccd cells (lanes I, 7, and 10) 
contain very little immunoreactive material compared to the induced samples in the remaining 
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lanes. The highest molecular weight band observed for each clone is consistent with the 
predicted size of the full length fusion protein. 

liach construct directs expression of high molecular weight fHMW) protein that is 
reactive w ith the toxin A antibody. The size of the largest immunoreaciive bands from each 
sample is consistent with predictions of the estimated MW of the intact fusion proteins. This 
demonstrates that the three fusions are in-frame, and that none of the clones contain cloning 
artifacts that disrupt the integrity of the encoded fusion protein. However, the Western blot 
demonstrates that fusion protein from the two larger constructs (pGA30-660 and pPAl 100- 
2680) are highly degraded. Also, expression levels of toxin A proteins from these two 
constructs are low, since induced protein bands are not visible by Coomassic staining (not 
shown). Several other expression constructs that fuse large sub-regions of the toxin A gene to 
cither pMALc or pET23a-c expression vectors, were constructed and tested for protein 
induction, fhese constructs were made by mixing gel puritkd restriction fragments, derived 
trom ihe expression constructs shown in Figure 6, with appropriately cleaved expression 
\cciors- ligaling. and selecting recombinant clones in which the toxin A restriction fragments 
had ligated logether and into the expression vector as predicted for in-frame fusions. The 
expressed toxin A interval within these constructs are shown in Figure 8. as well as the 
internal restriction sites utilized to make these constructs. 

As used herein, the term "interval" refers to any portion {i.e., any segment of the toxin 
which is less than the whole toxin molecule) of a clostridial toxin. In a preferred 
embodiment, "interval" refers to portions of diffwUv toxins such as toxin A or toxin B. it 
is also contemplated that these inier\als will correspond to epitopes of immunologic 
importance, such as antigens or imniunogens against which a neutralizing antibodv response is 
elTected. It is not intended that the present invention be limited to the particular intervals or 
sequences described in these Examples. It is also contemplated that sub-portions of intervals 
an epitope contained within one interval or which bridges multiple intervals) be used as 
compositions and in the methods of the present invention. 

In all cases. Western blot analysis of each of these constructs with goat antitoxin A 
aniibodx <Tech Lab) detected HMW fusion protein of the predicted size (not shown), fhis 
confirms that the reading frame of each of these clones is not prematurely terminated, and is 
fu.sed in the correct frame with the fusion partner. However, the Western blot analysis 
revealed that in all cases, the induced protein is highly degraded, and. as assessed by the 
absence of identifiable induced protein bands by Coomassic Blue staining, are expressed only 
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at low levels. These results suggest thai expression of high levels of intact toxin A 
recombinant protein is not possible when large regions of the loxin A gene arc expressed in 
E. coll using these expression vectors. 

c) High Level Expression Of Small Toxin A Protein Fusions In 
E, coli 

f:xpcrience indicates that expression difficulties are ollen encountered when large 
(greater than 100 kd) fragments are expressed in coli, A number of expression constructs 
containing smaller fragments of the toxin A gene were consirucicd, to determine if small 
regions of the gene can be expressed to high levels without extensive protein degradation. A 
summary of these expression constructs arc shown in Figure 9. All were constructed by in- 
frame fusions of convenient toxin A restriction fragments to either the pMAlx or pt:T23a'C 
\ ectors. Protein preparations from induced cultures of each of these con.siructs were analyzed 
by both Coomassie Blue staining and Western analysis as in (b) above. In all cases, higher 
levels of intact, full length fusion proteins were observed than with the larger recombinants 
from scciion (b). 

d) Purification Of Recombinant Toxin A Protein 

Large scale (500 ml) cultures of each recombinant from (c) were grown, induced, and 
soluble and insoluble protein fractions vvere isolated. The soluble protein extracts were 
affmily chromaiographed to isolate recombinant lusion protein, as described [Williams et aL 
(1994). Mupra]. In brief, extracts containing lagged pl: f fusions were chromaiographed on a 
nickel chelate coluinn. and elutcd using imidazole salts as described by the distributor 
(Novagen). Extracts containing soluble pMAL fusion protein were prepared and 
chromaiographed in column buffer (10 mM NaP04, 0.5M NaCL 10 mM P-mcrcaptoethanol, 
pH 7.2) over an amylose resin column (New England Biolabs). and elutcd with column 
buffer containing 10 mM maltose as described [Williams cf uL (1995). M/pra\. When the 
expressed protein was found to be predominantly insoluble. in.soluble protein extracts \vere 
prepared by tiie method described in [example 17. infro. The results are summarized in fable 
16. f igure 10 shows the sample purifications of recombinant toxin A protein. In this figure, 
lanes I and 2 contain MBP fusion protein purified by affinity purification oi' soluble protein. 
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TABLE 16 

Purification Of Recombinant Toxin A Protein 



Clone 


Protptn 

Solubility 


Yield Affinity 
Purified Soluble 
Protein 


% Intact Soluble 
Fusion Protein 


Yield Intact 
Insoluble Fusion 
Protein 


plVIA30-270 


Soluble 


4 mg/500 mis 


10% 


NA 


PMA )0-300 


Soluble 


4 ms/500 mis 


5-10% 


NA 


pM A 3 00-660 


liicnttiblt^ 




NA 


1 0 mp/SnO ml 
iw ina./ fill 


pMA66l>-J 100 


Soluble 


4.5 nig/500 mis 


50% 


NA 


pMAUOO-1610 


Soluble 


18 mfi/SOO mis 


10% 


NA 


pMA!6IO-l870 


Both 


22 mg/SOO mis 


90% 


20 ma/500 ml 


pMA 1450- 1870 


Insoluble 




NA 


0.2 my ^500 ml 


pPAIUJO-1450 


Soluble 


0, 1 mg/500 mis 


90% 


NA 


pPA I 100-1870 


Soluble 


0.02 mc/500 mis 


00% 


NA 


pMA 1870-2680 


Both 


12 mg/500 mis 


80% 


NA 


pPa 1870 2680 


Insoluble 




NA 


10 me/500 ml 



pP ^ plZT23 vector. pM=p M A Lc vector, A^toxin A. 

Based on 1 .5 OD^,., - I mg/ml texiinciion cocfHciem of MBP). 

listimatcd by Coomassic stainini: of SDS-PAGE ucls. 

Lanes .> and 4 contain MBP fusion protein purified by solubilization of insoluble inelusion 
20 bodies. The purified fusion protein samples are pMAI 870-2680 (lane 1). pMA660-l 100 (lane 

2). pMA300-600 (lane 3) and pMA1450-I870 (lane 4). 

Poor yields ol affinity purified protein were obtained when poly-hisiidine tagged pET 

vectors were used to drive expression (pPAl 100-1450, pP 1 100-1870), However, significant 

protein yields were obtained from pMAl. expression constructs spanning the entire toxin A 
25 gene, and yields of fuU-length soluble fusion protein ranged from an estimated 200-400 

}.ig/500 ml culture (pMA30-300) lo greater than 20 mg/500 ml culture (pMA1610-1870). 

Only one interval was expressed to high levels as strictly insoluble protein (pMA300-660). 

1 bus. although high level expression was not observed when using large expression constructs 

from the toxin A gene, usable levels of recombinant protein spanning the entire toxin A gene 
30 were obtainable by isolating induced protein from a series oi" smaller pMAL cxpressiori 

constructs that span the entire toxin A gene. This is the first demonstration oi' the feasibility 

of expressing recombinant toxin A protein to high levels in E. colL 
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c) Hemagglutination Assay Using The Toxin A Recombinant 
Proteins 

The carboxy terminal end consisting of the rcpeaiinu units contains the 
heniauglutination activity or binding domain ol (\ Ji(fkilc toxin A. To determine whether 
5 the expressed toxin A recombinants retain functional activity, hemagglutination assays were 

performetl. f wo toxin A recombinant proteins, one containing the binding domain as cither 
soluble aftlnity purified protein (pMA 1 870-2680) or SDS soiubiliz^ed inclusion body protein 
(pPA 1 870-2680) and soluble protein from one region outside that domain (pMAI 100-1610) 
were tested using a described procedure. [H.C. Krivan ci. ui^ Infect. Immun., 53:573 

10 ( 1 986). I Citrated rabbit red blood cells (RRBC)(Cocalico) were washed several times with 
fris-buffer ( O.IM Tris and 50 mM NaCl ) by cenirifugation at 450 x g for 10 minutes at 4"* 
C. A 1% RRBC suspension was made from the packed cells and resuspendcd in I ris-buffer. 
Dilutions of the recombinant proteins and native UKxin A (Tech Labs) were made in the fris- 
hulfcr and added in duplicate to a round-bottomed 96-well microliter plale m a tlnal \olLime 

15 of 100 To each well, 50 \x\ of the 1% RRBC suspension was added, mixed by gentle 
tapping, and incubated at A°V i'or 3-4 hours. Significant hemagglutination occurred only in 
the recombinant proteins containing the binding domain (pMA 1870-2680) and native loxin 
A. The recombinant protein outside the binding domain (pMA 1 100-1610) displayed no 
hemagglutination activity. Using equivalent protein concentrations, the hemagglutination titer 

20 for loxin A was 1 :256. while liters for the soluble and insoluble recombinant proteins of the 
binding domain were 1:256 and about 1:5000. Clearly, the recon^binant proteins tested 
retained funciional activity and v\erc able to bind RRBC s. 

EXAMPLE 12 

25 f unctional Activity Of IgY Reactive Against l oxin A Recombinants 

I he expression of recombinant loxin A protein as multiple fragments in E.coli has 
demonstrated the feasibility of generating toxin A antigen through use of recombinant 
methodologies (Kxamplc 1 1). The isolation of these recombinant proteins allows the 
30 immunoreaciivity of each iiidividual subrcgion of the toxin A protein to be determined [i.e.. 

in a antibody pool directed against the native toxin A protein), fhis identifies the regions (if 
any) for which little or no antibody response is elicited when the whole protein is used as a 
immunogcn. Antibodies directed against specific fragments of the loxin A protein can be 
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purified by affinity chromatography against recombinant toxin A protein, and tested for 
neutrahzation ability. This identifies any toxin A subregions that arc essential for produciny 
neutralizing antibodies. Comparison with the levels of immune response directed against 
these intervals when native toxin is used as an immunogen predicts whether poteniially higher 
tilers of neutrahzing antibodies can be produced by using recombinant protein directed against 
a individual region, rather than the entire protein. I*inally. since it is unknown whether 
antibodies reactive to the recombinant toxin A proteins produced in Rxaniple I i neutralize 
toxin A as effeciively as antibodies raised against native toxin A (Examples 9 and 10), ilic 
protective ability of a pool of antibodies afllniiy purified against recombinam toxin A 
fragments was assessed for its ability to neutralize toxin A. 

This Fxample involved (a) epitope mapping of the toxin A protein to determine the 
litre of specific antibodies directed against individual subregions of the toxin A protein when 
native toxin A protein is used as an immunogen. (b) affinity purification of IgY reactive 
against recombinant proteins spanning the toxin A gene, (c) loxin A neutralization assays wilii 
affinity purified IgY reactive to recombinant toxin A protein to identify subregions of the 
toxin A protein that induce the production of neutralizing aniibodies. and determination of 
whether complete neutralization of toxin A can be elicited with a mixture of antibodies 
reacti\ e lo recombinant toxin A protein. 

a) Epitope Mapping Of The ToKin A Gene 

I'hc afilniiy purification ol recombinant toxin A protein specific lo defined intervals ol 
liie u^xin /\ protein allows epitope mapping of antibody pools directed against native loxin A. 
rhis has not previously Ix'en possible, since previous expression of toxin A recombinants has 
been assessed only by Western blot analysis, without knowledge of the expression levels of 
the protein [cai., von llichel-Slreiber et ciL L Gen. Microbiol.. 135:55-64 (1989)|. Thus, high 
or low reactivity of recombinant loxin A protein on Western blots may reflect protein 
expression level differences, not immunoreactiviiy differences, (nven that the purified 
recombinant protein generated in lixample 1 1 have been quantiiaied. tlie issue of relative 
immunoreactivit\ of individual regions of the toxin A protein was precisely addressed. 

l or the purposes of this F"xamplc. the toxin A protein was subdivided into 6 intervals 
( 1-6). numbered from the amino (interval I) to the carboxyl (interval 6) termini. 

The recombinam proteins corresponding to these intervals were from expression clones 
(.see Hxample 1 1(d) for clone designations) pMA30-300 (interval 1). pMA300-660 (interval 
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2). pMA660-l 100 (interval 3). pPAl 100-1450 (interval 4). pMA1450-1870 (interval 5) and 
pMA 1 870-2680 (interval 6). These 6 clones were selected because they span the entire 
protein iVom amino acids numbered 30 through 2680. and subdivide the protein into 6 small 
intervals. Also, the carbohydrate binding repeat interval is contained specifically in one 
interval (interval 6). allowing evaluation of the immune response specifically directed against 
this region. Western blots of 7.5% SDS-PAGH gels, loaded and elecirophoresed with defined 
quantities ol each recombinant protein, were probed with cither goat antitoxin A polyclonal 
antibody ( fech Lab) or chicken antitoxin A polyclonal antibody [pd A IgY, j-xample 8(c)]. 
The blots were prepared and developed with alkaline phosphatase as previously described 
I Williams cl al. (1095). supra]. At least 90% of all reactivity, in either goal or chicken 
antibody pools, was found to be directed against the ligand binding domain (interval 6). The 
remaining immunorcactivity was directed against all five remaining intervals, and was similar 
in both antibody pools, except that the chicken antibody slwwed a much lower reactivity 
against interval 2 than the goal anlil>ody. 

This clearly demonstrates that when native toxin A is used as an immunogen in goals 
t>r chickens, the bulk of ihe immune response is directed against the ligand binding domain of 
the protein, w iih the remaining response distributed throughout the remaining 2^"> o(' the 
protein. 

b) Affinity Purirication Of IgY Reactive Against Recombinant 
Toxin A Protein 

Affinity columns, containing recombinant toxin A protein from the 6 defined intervals 
in (a) above, were made and used to ti) affinity purify antibodies reactive to each individual 
interval from the C I A IgY preparation li-xample 8(c)l. and (ii) deplete interval specific 
antibodies from the CTA IgY preparation. Affinity columns were made by coupling 1 ml of 
PBS-washed Actigel resin (Sierogene) with region specific protein and t/10 final volume of 
Aid-coupling solution ( IM sodium cyanoborohydride). fhe total region specific protein 
added to each reaction mixture was 2.7 mg (interval I). 3 mg (intervals 2 and 3). 0.1 mg 
(interval 4), 0.2 mg (interval 5) and 4 mu (interval 6). Protein for intervals I. 3. and 6 was 
affinity purified pMAI fusion protein in column buffer (see Lxample 11). Interval 4 was 
affinit\ purified poly-hislidine containing pET fusion in 1*BS: intervals 2 and 5 were from 
inclusion body preparations ol' insoluble pMAL fusion protein, dialyzcd extensively in i'BS. 
Aliquois of the supcrnaiants from the coupling reactions, belbre and after coupling, were 
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assessed by Coomassie staining ol* 7.5% SDS-PAGE gels. Based on protein band intensities, 
in all cases greater than 50% coupling efficiencies were estimated. The resins were poured 
into 5 nil BioRad columns, washed extensively with PBS. and stored at 4°C, 

Aliquols of the CTA IgY polyclonal antibody preparation were depleted for each 
5 individual region as described below. A 20 ml sample ot the CTA IgY preparation (f-xample 
8{c)| was dialyzcd extensivciv against 3 changes of PBS ( 1 liter for each dialysis), quantitated 
by absorbance at OIX.^,,, and stored at 4*^0. Six 1 ml aliquois of the dialyzed IgY preparation 
were removed, and depleted individually for each of the six intervals, l^ach 1 ml aliquot was 
passed over the appropriate affinity column, and the eluate twice reapplied to the column. 

10 I he eluate was collected, and pooled with a I ml PBS wash. Bound antibody was eluted 

from the column by washing with 5 column volumes of 4 M Guanidine-HCl (in 10 mM Tris- 
IICL pll 8.0). I he column was reequilibrated in PBS. and the depleted antibody stock 
reapplied as described above. The eluate was collected, pooled with a 1 ml PBS wash, 
quaniilaled by absorbance at OD.^^y, and stored at 4° C. In this manner. 6 aliquots of the CTA 

15 IgY preparation were individually depleted for each of the 6 toxin A intervals, by two rounds 
of afllnity depletion. The specificity of each depleted stock was tested by Western blot 
analysis. Multiple 7.5% SDS-PAGli gels were loaded with protein samples corresponding to 
all 6 toxin A subrcgions. After electrophoresis, the gels were blotted, and protein transfer 
confirmed by Ponceau S staining [protocols described in Williams cf ai (1995). supra\. After 

20 blocking the blots I hr at 20%^ in PBS+ 0.1% 1 ween 20 (PBST) containing 5% milk (as a 
blocking buffer). 4 ml of either a 1/500 dilution of the dialyzed CTA IgY preparation in 
blocking buffer, or an equi\aleni amount of the six depleted antibody .slocks (using OD.^j, to 
standardize antibody concentration) were added and the blots incubated a further 1 hr at room 
temperature. The blots were washed and developed with alkaline phosphatase (using a rabbit 

2:> anti-chicken alkaline phosphate conjugate as a secondary antibody) as previously described 

(Williams t'/ ai (1995), sitpra]. In all eases, only the target interval was depleted for antibody 
reactivity, and at least 90% of the reactivity to the target intervals was specifically depleted. 

Region specific antibody pools were isolated by affinity chromatography as described 
below, l en mis of the dialyzed ( TA IgY preparation were applied sequentially to each 

30 aifmily column, such that a single 10 ml aliquot was used to isolate region specific antibodies 
specific to each of the six subrcgions. The columns were sequentially washed with 10 
volumes of PBS. 6 volumes of BBS-Tween. 10 volumes of TBS. and eluted with 4 ml 
Actisep elution media (Sterogcne). fhe eluate was dialyzed extensively against several 
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changes of PBS. and the affmily purified antibody collected and stored at 4°C. The volumes 
of the eluate increased to greater than 10 mis during dialysis in each case, due to the high 
viscosiiy of the Aciiscp elution media. Aliquols of each sample were 20x concentrated using 
Cenlricon .>0 microconcenlrators (Amicon) and stored at 4^C. The specificity of each region 
specific antibody pool was tested, relative to the dialyzed C I'A IgY preparation, by Western 
blot analysis, exactly as described above, except that 4 ml samples ol' blocking buffer 
containing 100 ^1 region specific antibody (unconcentraled) were used instead of the depleted 
C fA luY preparations. Each affinity purified antibody preparation was specific to the defined 
interval, except that samples purified against intervals 1-5 also reacted with interval 6. This 
may be due to non-specific binding to the interval 6 protein, since this protein contains the 
repetitive ligand binding domain which has been shown to bind antibodies nonspecifically. 
|Lyerl> a al.. Curr. Microbiol.. 19:303-306 (1989)1 

The reactivity oi each affinity purified anlibod) preparation to the corresponding 
proteins was approximately the same as the reactivity of the l/SOO diluted dialyzed C I A igY 
preparation standard. Given that the specific antibody stocks were diluted 1/40. this would 
indicate that the unconcentraled atfinity purified antibody slocks contain 1/10-1/20 the 
concentration of specific antibodies relative to the starting C'TA IgY preparation. 

c) Toxin A Neutralization A.ssay' Using Antibodies Reactive 
Toward Recombinant Toxin A Protein 

fhe Clio toxin neutralization assay [Example 8(d)l was used to assess the ability of 
the depleted or enriched samples generated in (b) above to neutralise the cytotoxicity of toxin 
A. The general ability of atTmity purified antibodies to neuiralizc toxin A was assessed by 
mixing logelher aliquols of all 6 concentrated stocks of the (^ affinity purified samples 
generated in (b) above, and testing the ability of this mixture to neutralize a toxin A 
concentration of O.I |.ig/mL The results, shown in Figure 1 1. demonstrate almost complete 
neutralization of toxin A using the affinity purified (AP) mix. Some epitopes within the 
recombinant proteins utilized for affinity purification were probably lost when the proteins 
were denatured before affinity purification [by Ciuanidine-I I( I treatment in (b) above), i bus. 
the neutralization ability of antibodies directed against recombinant protein is probably 
underestimated using these aftlnity purified antibody pools. I bis experiment demonstraics 
ihat antibodies reactive to recombinant toxin A can neutralize cytotoxicity, suggesting that 
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nculralizine antibodies may be generated by using recombinant toxin A protein as 
immunogcn. 

In view of the observation that the recombinant expression clones of the toxin A gene 
divide the protein into 6 subrcgions. the neutralizing ability of antibodies directed against each 

' 5 individual region was assessed, rhe neutralizing ability of antibodies directed against the 
Hgand binding domain of toxin A was determined ftrsl. 

In the losin neutralization expcrimenl shown in Figure 1 1, interval 6 specific 
antibodies (interval 6 contains the ligand binding domain) were depleted iVom the dialyzcd 
PHG preparation, and the cliect on toxin neutralization assayed. Interval 6 antibodies were 

10 depleted either by utilizing the interval 6 depleted CTA IgY preparation from (b) above {"-6 
aff. depleted" in l igure 1 1). or by addition of interval 6 protein to the CTA IgY preparation 
(estiniaied to be a 10 Ibid molar excess over anti-interval 6 immunoglobulin present in this 
preparation » to compelilivcly compete for interval 6 protein ("-6 prot depleted" in Figure 1 1). 
In both instances, removal oi interval 6 specific antibodies reduces the neutralization 

13 efficiency relative to the starting CTA IgY preparation. I his demonstrates that antibodies 

directed against inter\al 6 contribute to toxin ncutraliz-ation. Since interval 6 corresponds to 
the ligand binding domain of the protein, these results demonstrate that antibodies directed 
against this region in the Pl'.G preparation contribute to the neutralization ol toxin A in this 
assa\ . However, it is significant that after removal of these antibodies, the PI:Cj preparation 

20 retains significant ability to neutralize toxin A (Figure I I). Ihis neutralization is probably 
due to the action of antibodies specific to other regions of the toxin A protein, since at least 
*>()% of the ligand binding region reactive antibodies were removed in the depleted sample 
prepared in (b) above. I his conclusion was supported by comparison ol the toxin 
neutralization ol the affinity purified ( AP) mix compared to alTmity purii"ied interval (> 

25 aniibodx alone. Although some neutral izaiion ability was observed with AP interval 6 

antibodies alone, the neutralization was significantly less than that observed with the mixture 
of all 6 AP antibody slocks (not shown). 

( iiven that the mix of all six affinity purified samples almost completely neutralized 
the cytotoxicity of toxin A (Figure 1 I), the relative imponance of antibodies directed against 

30 toxin A intervals 1-5 within the mixture was determined. This was assessed in two ways. 

First. samples containing affinity purilled antibodies representing 5 of the 6 intcr\ als were 
prepared, such that each individual region was depleted iVom one sample. Figure 12 
demonstrates a sample neutralizntion curve, comparing the neutralization ability of alfmity 
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. purilled antibody mixes without interval 4 (-4) or 5 (-5) specillc antibodies, relative to the 
mix or all 6 affinity purified antibody stocks (positive control). While the removal of interval 
5 specillc antibodies had no effect on toxin neutrahzation (or intervals 1-3. not shown), the 
loss ol interval 4 specific antibodies signincanily reduced toxin neutralization (Tigure 12). 

Similar results were seen in a second experiment, in which affinity purified antibodies, 
directed against a single region, were added to interval 6 specific antibodies, and the effects 
on toxin neutralization assessed. Only interval 4 specific antibodies significantly enhanced 
neutralization when added to interval 6 specific antibodies (l-igure 13). These results 
demonstrate that antibodies directed against interval 4 (corresponding to clone pPAl 100-1450 

iO in I'igurc *>) are important for neutralization of cytotoxicity in this assay. Epitope mapping 

has shown that only low levels ol antibodies reactive to this region are generated when native 
tt>xin A is used as an immunogen (Example 12(a)|. It is hypothesized that immunization with 
recombinant protein .specific to this interval will elicit higher tilers of neutrahzing antibodies. 
In summary, tins analysis has identified two critical regions oi the toxin A protein against 

15 which neutralizing antibodies are produced, as assayed by the CTIO neutralization assay. 



EXAMPLK 11 

Production And F.valuaiion Of Avian Antitoxin 
Against C. Jifficilc Recombinant l oxin A I*oiy peptide 

20 

In Hxample 12. we demonstrated neutralization of loxin A mediated c\ totoxicity b\ 
al'finity purified antibodies reactive to recombinant loxin A protein. 1 o determine whether 
aniibi>dies raised against a recombinant polypeptide fragment olC Jifjicilc loxin A may be 
effective in treating clostridial diseases, antibodies to recombinant loxin A protein representing 

25 the binding domain were generated. Two toxin A binding domain recombinant polypeptides, 
expressing the binding domain in either the pMAlx (pMAl 870-2680) or pliT 23(pPA1870- 
2680) vector, were u.sed as immunogens. The pMAl. protein was affinity purified as a soluble 
product (Example 12(d)] and the pEI protein was isolated as insoluble inclusion bodies 
IF.xample I2(d)l and solubilized to an immunologically active protein using a proprietary 

30 method described in a pending patent application (U.S. Patent Application Serial No. 
08/12*^>.027). I his Example involves (a) immunization, (b) antitoxin collection, (c) 
determination ot antitoxin antibody titer, (d) anti-recombinanl loxin A neutralization of loxin 
A hemagglutination aciivity //; r;//7>. and (e) as.say of in vitro toxin A neutralizing activity. 
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a) Immunization 

I he soluble and the inclusion body preparations each were used separately to 
immunize hens. Both purified toxin A }X)lypcptides were diluted in PBS and emulsified with 
approximaieiy equal volumes of CPA for the initial immunization or IFA for subsequent 
5 booster immunizations. On day zero, for each of the recombinant preparations, two egg 

laying while Leghorn hens (obtained from local breeder) were each injected at multiple sites 
(intramuscular and subcutaneous) with I ml of recombinant adjuvant mixture containing 
approximately 0.5 to 1.5 mgs of recombinant toxin A. Booster immunizations of LO nig 
were given on days 14 and day 28. 

10 

b) Antitoxin Collection 

foial \olk immune IgY was extracted as described in the standard PFCi protocol (as in 
i-xample 1 ) and the Unal IgY pellet was dissolved in sterile PBS at the original \oIk volume. 
I his material is designated "immune recombinant IgY" or "immune IgY." 

15 

c) Antitoxin Antibody Titer 

lo determine if the recombinant toxin A protein was suftlcicnlly immunogenic to raise 
antibodies in hens, the antibody liter of a recombinant toxin A polypeptide was determined by 
IiLISA. [:ggs from both hens were collected on day .12. the yolks pi>oled and the antibody 

20 was isolated using PEG as described. The immune recombinant IgY antibody tiler was 

determined lor ihe soluble recombinant protein containing the mahose binding protein fusion 
generated in p-Mal (pMA1870-2(>80). Ninciy-six well Falcon Pro-bind plates were coated 
overnight at 4''{* w ith 100 ^1 /well of loxin A recombinant at 2.5 |.ig /^.d in PBS containing 
().05"/» ihimerosaf Another plate was also coaled with maltose binding protein (MBP) at the 

25 same concentration, to permit comparison of antibody reactivity to the fusion partner. The 

next da\\ the wells were blocked with PBS containing 1% bovine .serum albumin (BSA) for 1 
hour at ?j7°C\ IgY isolated from immune or prcimmune eggs was diluted in antibod\ diluent 
(PBS containing 1% BSA and 0.05% Tween-2()). and added to the blocked wells and 
incubated for 1 hour at jTT . The plates were washed three limes with PBS wiUi 0.05% 

?0 '/\vecn-20. then three limes with PBS. Alkaline phosphata.se conjugated rabbit ami-chicken 
IgG (Sigma) diluted 1:1000 in antibody diluent was added to the plate, and incubated for 1 
hour at 37'*C. The plates were washed as before and substrate was added, fp-niirophenyl 
phosphate (Sigma)| at 1 mg/ml in 0.05M Na.CO,. pH 9.5 and 10 mM MgCI>. The plates 
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were evaluated quantitatively on a Dynatech MR 300 Micro IIP A piale reader at 410 nm 
about 10 minutes after the addition of substrate. 

Based on these ELISA results, high antibody tilers were raised in chickens immunized 
with the toxin A recombinant polypeptide. The recombinant appeared to he highly 
immunogenic, as it was able to generate high antibody tilers relatively quickly with few^ 
immunizations. Immune IgY titer directed specificaily to the toxin A portion of the 
recombinant was higher than the immune IgY titer to its fusion partner, the maltose binding 
protein, and significantly higher than the prcimmune IgY. h'LlSA liters (reciprocal of the 
highest dilution of IgY generating a signal) in the preimmune IgY to the MBP or the 
recombinant was <l:30 while the immune IgY titers to MBP and the toxin A recombinant 
were I:1H750 and > 1:93750 respectively. Importantly, the anii-recombinani antibody titers 
generated in the hens against the recombinant ptily peptide is much higher, compared to 
antibodies to that region raised using native toxin A. I he recombinant antibody tiler to 
region 1870-2680 in the CfA antibody preparation is at least li\'e-l"oki lower compared to the 
recombinant generated antibodies (1:18750 versus 1:93750) I bus. it appears a better 
immune response can be generated against a speciilc recombinant using thai recombinant as 
the immunogen compared to the native toxin A. 

This observation is significant, as it shows that because recombinant portions stimulate 
the production of antibodies, il is not necessary to use native toxin molecules to produce 
antitoxin preparations. Thus, the problems associated with the toxicity of the native toxin arc 
avoided and large-scale antitoxin production is facilitated. 

d) AntURccombinant Toxm A Neutralization Of Toxin A 
Hemagglutination Activity In Vitro 

Toxin A has hemagglulinating activity besides cytotoxic and enteroioxin properties. 
Specifically, toxin A agglutinates rabbit erythrocytes by binding lo a trisaccharide (gal 1-3B1- 
4GlcNAc) on the cell surface, fH. Krivan ai ai. Infect. Immun.. 53:573;581 (I986).| We 
examined whether the anii-recombinant toxin A (immune IgY. antibodies raised against the 
insoluble product expressed in pET) can neutralize the hemaggluiination aciivity oT toxin A in 
vitro. The hemagglutination as.say procedure u.sed was described by ILC. Krivan tv «/ 
Polyethylene glycol -fractionated immune or prcimmune IgY were pre-absorbed with ciiraled 
rabbit erythrocytes prior to performing the hemaggluiination assay because we have found that 
IgY alone can agglutinate red blood cells. Ciiraled rabbit red blood cells (RRBC's)(Cocalico) 
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were washed twice by cenirifugation ai 450 x g with isotonic buffer (0.1 M Tris-HCL 0,05 M 
NaCL pH 7.2). RRBC-rcaciive antibodies in the IgY were removed by preparing a 10% 
RRBC suspension (made by adding packed cells to immune or preimmune IgY) and 
incubating ihc mixture tor I hour at 37°C. The RF^BCs were then removed by centrilligaiion. 
Neutralization of the hemagglutination activity ol toxin A b\ antibody was tested in round- 
bottomed %-well microtiter plates. Twenty-five fil of toxin A (36 (.ig /ml) (Tech Lab) in 
isotonic buffer was mixed with an equal volume of different dilutions of immune or 
preimmune IgY in isotonic buffer, and incubated for 15 minutes at room temperature. Then, 
50 mI oi a t% RRBC suspension in isotonic buffer was added and the mixture was incubated 
for 3 hours at 4°(\ I\>silivc control wclis containing the final concentration of 9 of 
toxin A after dilution without IgY were also included. Hemagglutination activitv was 
assessed \ isually. with a diffuse matrix of RRBC's coaling the bottom of the well 
representing a positive hemagglutination reaction and a tight button of RRBC' s at the bottom 
of the well representing a negative reaction. I he anii-recombinani immune IgY neutralized 
toxin A hcnumglutination activity, giving a neutralization liter of l:K. However, preimmune 
IgY was unable to neutralize the hemagglutination ability of loxin A. 

c) Assay Of //i Vitro Toxin A Neutralizinf; Activity 

Ihc ability of the anii-recombinanl toxin A IgY (immune IgY antibodies raised against 
PV1AIS70-26K0. the soluble recombinant binding domain protein expressed in pMAL. 
designated as Anti-tox. A-2 in f igure 14 , and referred U) as recombinant region <>) and pre- 
immune Ig^'. prepared as described in hxampie 8(c) above, to neutralize the cytotoxic activitv 
of toxin .A was assessed i7/;v/ using the CI 10 cell cytotoxicity assay, and loxin A ( I ech 
L ab) at a conceniralion of 0. l|.ig/ml. as described in Lxample 8(d) above. As additional 
conirols. the anti-native toxin A IgY (CTA) and prc-unmune IgY preparations described in 
lixample 8(c) above were also tested. The results arc shown in Figure 14. 

I he anti-recombinanl loxin A IgY demonstrated onl\- partial neutralization of the 
cytotoxic activity of toxin A. while the pre-immune IgY did not demon.siratc any significant 
neutralizing activity. 
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EXAMPLE 14 

In vivo Neutralization QI C', diffkllc loxin A 

The ability of avian antibodies (IgY) raised against recombinant loxin A binding 
5 domain to neutralize the enteroloxin activity oi C. difficile loxin A was evaluated in vivo 
using Golden Syrian hamsters. The Example involved: (a| preparation of the av ian anti- 
recombinani loxin A IgY for oral adniinislration: (h) in vivo protection oJ" hamsters from ( 
ilifficilc toxin A enicrotoxicity by treatment of loxin A wiih avian anii-recombinant loxin A 
IgY: and (e) histologic evaluation of hamster ceca. 

10 

:i) Preparation Of The Avian Anti-Kecombinant Toxin A IgY 
For Oral Administration 

Lggs were collected from hens which had been immunized with the recombinant (' 
JifficilL' loxin A fragment pM A 1 870-2680 (described in Example 13. above). A second group 

15 of eggs purchased at a local supermarket was used as a pre-immune (negaiivc* control. Egg 
yolk immunoglobulin (IgY) was exiracicd by PEG from the two groups t)f eggs as described 
in Example S(c). and the ilnal IgY pellets were solubilized in one- fourth the original volk 
volume using O.IM carbonate buffer (mixture of Nai ICO: and Ka,CX),). pH 9 5 I he basic 
carbonate buffer was used in order lo protect the loxin A from the acidic pi I of the stomach 

20 environment. 

b) /// vivo Protection Of Hamsters A«;ainst C iUfftcUc Toxin A 
Enterotoxieity By Treatment Of Toxin A With Avian Anti- 
rccomhinant Toxin A IgY 

In order to assess the ability of the avian anti-recombinani loxin A IgY. prepared in 
section (a) above to neutralize the in vivo enteroloxin activity of toxin A. an ni vivo loxin 
neutrali/atitMi model was developed using Golden Syrian hamsiers. I his model was based on 
published values for the minimum amount of loxin A required u> elicit diarrhea (0.08 mg 
toxin A/Kg body wi.) and death (0.16 mg loxin A/Kg body wt.) in hamsters when 
30 administered orally (Eyerly el uL Infect. Immun.. 47:349-352 ( 1985). 

l or ihe study, lour separate experimental groups were used, with each group consisting 
4)1 7 i'emale Ciolden Syrian hamsiers (Charles River), approx. three and one-half weeks old. 
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weighing approx. 30 gms each. The animals were housed as groups of 3 and 4. and were 
oifercd food and water cut libitum through the entire length of the study. 

Tor each animal, a mixture containing eitlicr 10|.ig of toxin A (0.2 mg/Kg) or 30|jg of 
toxin A <0.6 mg/Kg) (('. tliflidk toxin A was obtained from fcch Lab and 1 ml of cither the 
' 5 anti-recombinant toxin A IgY or pre-immune IgY (iVom section (a) above) was prepared. 

These mixtures were incubated at 37°C for 60 min. and were then administered to the animals 
by ihc oral route. 1 he animals were then observed for the onset of diarrhea and death for a 
period oi* 24 hrs. following the administration of the to.\in A+^lgY mixtures, ai the end ol 
which time, the following results were tabulated and shown in I'able 17; 

10 TABLE n 



Study Outcome At 24 Moiirs 



f-.\perimenial Ciroiip 


Study Outcome at 24 Hours 


Healthy' 


Dinnrhc;r 


Dead 


10 uj: l()\in A ^ Antitoxin Ai:ainsi Interval 6 


7 


I) 


t> 


M) fox in A - Aniiloxin Against Interval 6 




f) 


t) 


10 loxin A • i*re-lmmunc SLTimi 


t) 


s 


■> 


lO iiu loxin A • l^rc-hnmiine 


t) 


5 


1 



Anini:iis remained hcailiiy ihrouiih the entire 24 hour iludy period. 
Animiils developed diarrhea, bin did not die. 
20 Animals developed diarrhea, and subsequently died. 

IVetreatment of toxin A at both doses tested, using the anli-reeombinant toxin A IgY. 

prevented all overt symptoms oi disease in hamsters. Therefore, pietrealment of (' i/ifficitc 

toxin A. using the anti-reeombinant toxin A luY. neuirahzed the in vivo enteroioxin aeiivity 

ot' the toxin A. In eonirast, all animals from the two groups which reeeived toxin A which 

25 had been pretrcaled using pre-immune IgY developed disease symptoms which ranged from 

diarrhea lo death. The diarrhea which developed in the 5 animals which did not die in each 

of the two pre-immune groups, spontaneously resolved by the end of the 24 hr. study period. 

c) Histologic Evaluation Of Hamster Ccca 

30 In order lo furtlier assess the ability of anii-rceombinani toxin A IgY to protect 

hamsters from the enteroioxin activity of toxin A. histologic evaluations were performed on 
the ceca of hamsters from the study described in section lb) above. 

Three groups of animals were sacrificed in order to prepare histological specimens, 
fhe first group consisted of a single representative animal taken Irom each of the 4 groups of 



- 91 - 



WO98/0a540 



PCT/US97/15394 



surviving hamsters at ihe conclusion of ihe study described in section (b^ above. These 
animals represented the 24 hr. timcpoint of the study. 

The second group consisted of two animals which were not part of tlie study described 
above, but vvere separately treated with the same toxin A » pre-immune igY mixtures as 
described for the animals in section (b) above. Both of these hamsters developed diarrhea, 
and were sacrificed 8 hrs. after the lime ol" administration of the toxin A ^ pre-immune IgY 
mixtures. At the time ol sacrifice, both animals were presenting symptoms ol* diarrhea. 
These animals represented the acute phase ol' the study. 

I he final group consisted of a single untreated hamster trom the same shipment of 
animals as those used for the two previous groups. I his animal served as the normal control. 

Samples of cecal tissue were removed I'rom the 7 animals described above, and were 
fixed overnight at 4^C' using 10% buffered formalin. The fixed tissues were paraffin- 
embedded, sectioned, and mounted on glass microscope slides. The tissue sections were then 
stained using hematoxylin and eosin (II and E slain), and were examined by light microscopy. 

I he tissues obtained from the iwo 24 hr. animals which received mixtures containing 
either K^^g i>r 30|.ig i>f toxin A and anti-recombinant loxin A IgY were indistinguishable i'rom 
the normal control, both in terms ol' gross palholog\ . as well as at the micro.scopic level. 
These observations provide llirther evidence for the ability of anii-recombinant toxin A IgY to 
effectively neutralize the in vivo cnierotoxin activity ot iii/ficilc toxin A. and thus its ability 
to prevent acute or lasting loxin A-induced pathology , 

In contrast, the tissues from the two 24 hr. animals which received the toxin A + pre- 
immune IgY mixtures demonstrated significant pathology. In both o\ these groups, the 
mucosal layer was observed to be less organized than in the normal control tissue. 1 he 
cytoplasm of the epithelial cells had a vacuolated appearance, and gaps were present between 
the epithelium and the underlying cell layers. The lamina propria was largely absent. 
Intestinal villi and crypts vvere significantly diminished, and appeared to have been overgrown 
by a planar layer of epithelial cells and fibroblasts. I herefore. although these animals overtly 
appeared to recover from the acute symptoms of toxin A intoxication, lasting pathologic 
alterations to the cecal mucosa had occurred. 

I he tissues obtained from the two acute animals which received mixtures of toxin A 
and pre-immune IgY demonstrated the most significant pathology. At the gross pathological 
level, both animals vvere observed to have severely distended ceca which vvere filled with 
watery, diarrhea-like material. At the microscopic leveK the animal that was given the 



- 92 - 



wo 98/08540 



PCT/USy7/l5594 



mixture containing lO/.ig of toxin A and pre-immune IgY was Found to have a mucosal layer 
which had a ragjzed, damaged appearance, and a disorganized, compacted quality. The crypts 
were largely absent, and numerous breaks in the epithelium had occurred. There was also an 
intlux ol' erythrocytes into spaces between the epithelial layer and the underlying tissue. The 
animal which had received the mixture containing 30|.ig of toxin A and pre-immune IgY 
demonstrated the most se\'cre pathology. The cecal tissue of this animal had an appearance 
Ncry similar to that observed in animals which had died from (', diffjcilc disease. Widespread 
destruction of ihe mucosa was noted, and the epithelial layer had sloughed. Hemorrhagic 
areas containing large numbers of erythrocytes were very prevalent. All semblance of normal 
tissue architecture was absent from this specimen, hi terms of the presentation of pathologic 
cvenis. this tn vn o hamster model of toxin A-intoxication correlates very cIosel\ with the 
paihoini:ic consequences oi' C. Jijftcilc disease in hamsters. The results presented in this 
I xampic dcmon.strate that while ami recombinant toxin A (interval 6) IgY is capable of only 
pariiall\ neuiraiizing the cytotoxic activity of C. (li/f '/cilc toxin A. the same antibody 
cffeciix ely neutralizes 100% of the in viv<i enierotoxin activity of the toxin. While it is not 
intended that this invention be limited to this mechanism, this may be due to the cytotoxicity 
and entcioioxicity of ( ' ili/ficilc Toxin A as two separate and distinct biological functions. 

EXAMPLE 15 

In I Vvn Neuirali/ation Of Difficile l oxin A By 
Antibodies Against Recombinant Toxin A Polypeptides 

fhc ability ol avian antibodies directed against the recombinant ( diUkile toxin A 
fragment 1870-2680 (as expressed by pMA 1870-2680: see Hxample 13) to neutralize the 
enteroioxic activity of toxin A was demonstrated in Example 14. I he ability of avian 
antibodies (IgYs) directed against other recombinant toxin A epitopes to neutralize native 
toxin A m vivo was next evaluated. This example involved: (a) the preparation of IgYs 
against rccombinani toxin A |v>lypepiides; (b) m vivo protection of hamsters against toxin A 
by ircaimcni with anti-recombinant toxin A IgYs and (c) quantification of specific antibody 
ci>nceniration in CTA and Interval b IgY PEG preparations. 

I he nucleotide sequence of the coding region of the entire toxin A protein is listed in 
SEQ 10 NC):5. The amino acid sequence of the entire toxin A protein is listed in SlvQ ID 
N():6. The amino acid sequence consisting of amino acid residues 1870 through 2680 of 
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loxin A is listed in SHQ ID N0:7. The amino acid sequence consisting of amino acid 
residues 1870 through 1%0 of loxin A is listed in SFQ ID NC):«. 

a) Preparation Of IgYN Against Recombinant Toxin A 
Polypeptides 

l!ggs were collected from l.eghorn hens which have been immunized with recombinant 
( *- Ji/flcilc loxin A polypeptide fraumcnts encompassing the entire loxin A proiein. The 
pnlypeplide fragments used as immunouens were: 1) pMA IH7()-2680 (hilerval 6). 2) pPA 
1 100-1450 {Interval 4). and 3) a mixture of fragments consisting of pMA 30-300 (interval 1). 
piN4A 300-660 (Interval 2), pMA 660-1 100 (Interval 3) and pMA 1450-1870 (Interval 5). 
fhis mixture of immunogens is referred to as Interval 1235. The location of each interval 
within the toxin A molecule is shown in Figure I5A. In iMgure I5A, the following 
abbreviations arc used: pP refers to the pHT23 vector <New lingland BioLabs): pM refers to 
the pMAl.'^'-c vector (New England BioLabs): A refers to loxin A: the numbers reter to the 
amint) acid interval expressed in the clone. (For example, ihe designation pMA3()-3()0 
indieaies that the recombinant clone encodes amino acids 30-300 oi toxin A and the x eeior 
used w as pM A I . ' ^*-c ) 

fhc recombinant proteins were generated as described ni [Example I I. Ihe IgYs were 
extracted and solubili/ed in 0. IM carbonate buffer pll 9.5 for oral adntinisiration as described 
in l;Nample 14(a). 1 he IgY reactivities against each individual recombinant interval was 
evaluated by FLISA as described in Fxample 13(c). 

h) hi Vivo Protection Of Hamsters Against Toxin 

A By Treatment With Anti-Recombinant Toxin \ Antibodies 

I he ability ol antibodies raised against recombinant toxin A polypeptides to provide in 
vivo protection against the enteroioxic activity of toxin A was examined in the hamster model 
system. This assay was performed as described in Hxatiiple 14(b). Briclly. for each 40-50 
gram t'emale Ciolden Syrian hamster (Charles River). I ml of IgY 4X (i.e.. resuspended in 1/4 
of the original yolk volume) PEG prep against Interval 6. Interval 4 or Interval 1235 was 
mixed with 30 |.ig (I.D,„» oral dose) of difficile loxin A (Tech I.ab). Preimmune IgY 
mixed v\ ith loxin A served as a negative control. Antibodies raised against ( difficile toxoid 
A (Rxample 8) mixed with toxin A (CTA) served as a positive control. 1 he mixture was 
incubated for 1 hour at 37°C then orally administered to lightly etherized hamsters using an 
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I XG reedinu needle. The animals were (hen observed for ihe onset of diarrhea and death for 
a period of approximately 24 hours. The results are shown in Table 18. 



TABLP 18 

Study Oiiiconic After 24 Hours 



Treatment iiroiip 


Healihy' 


Diarrhca- 


Dead- 


iVciminunc 


0 


0 


7 


CIA 


5 


0 


0 


iniervnl 6 


6 


1 


0 


liucrval \ 


0 


\ 


6 


tntcrvat 1235 


0 


IJ 


7 



Aiiiiunl shows no siizn of illness. 

Animal developed diarrhea, bui did not die. 

Aninial developed diarrhea and died 



Pre-iieaimcnt of toxin A with It^Ys aiiainst inicr\ai 6 prevented diarrhea in 6 of 7 
liamsteiN and completely prevented death in all 7. In cotiirasl, as with preitnmune liiY. luYs 
auainsl Interval 4 and Interval 1235 had no effect on the onset of diarrhea and death in the 
hamsters. 

c) Quantification Of Specific Antibody Concentration In CTA 
And Intcnat 6 IgY PEG Preparations 

l o determine the purity of I^Y PEG preparations, an aliquot of a pM A i 870-2680 
(Inicrvai 6) luY PlX i preparation was ehromaiographed usinu I (PLC and a K\V'803 sizing 
column (Shodcx). 1 he resulting profile of absorbanee at 280 nm is shown in f igure 16. I he 
single large peak corresponds lo the predicted MW of IgY. Integration of the area under the 
single large pe^ik showed that greater than 95% of the protein elutcd from the column was 
present in this single peak. This result demonstrated that the majoriiy (>95%) of the material 
absorbing at 280 nm in the PEG preparation corresponds lo IgY. fherefore. absorbanee at 
280 nm can be used to determine the total antibody coricentraiion in PE(i preparations. 

1 o determine the concentration of Interval 6-speeitle antibodies (expressed as perceni 
of total antibody) within ihe CTA and pMA1870-2680 (Interval 6) PEG preparations, defmed 
c|uanlities of these antibody preparations were affmiiy purified on a pPA 1 870-2680(11) (shown 
schematically in f igure 15B) afilnity column and the .specific antibodies were quantified. In 
Figure iSR Ute following abbreviations arc used: pP refers to the pE*I23 vector (New England 
BioLabs): pM refers to the pMAL'^*-c vector (New England BioLabs): pG refers to the pGEX 



- 95 - 



wo 98/08540 



PCT/US97/15394 



vector ( Pharmacia); pB refers to the Pinpoint'^' Xa vector (Promega): A refers lo toxin A; the 
numbers refer lo the amino acid interval expressed in the clone. F he sohd black ovals 
represent the MBP: the hatched ovals represent glutathione S-transferase: the hatched circles 
represent the biolin tag: and HWW represents the poly-histidine tag. 

An afilniiy column containing recombinant toxin A repeat protein was made as 
follows. Tour ml of PBS-washed Actigel resin (Sterogenel was coupled with 5-10 mg of 
pPA 1 870-2680 inclusion body protein (prepared as described in lixampie (17) and dialyzed 
into PBSl in a 15 ml lube (Falcon) containing 1/10 final volume Aid-coupling solution (I M 
sodium c> anoborohydride), Aliquois of the supernatant from the eoupling reactions, before 
and after coupling, were assessed by Coomassie staining of 7.5^1) SDS-PAGl-: gels. Based 
upon protein hand intensities, greater than 6 mg ol recombinant protein was coupled lo the 
resin. The resin was poured into a 10 ml column (BioRad). washed extensiveiy with PBS, 
pre-eluied with 4 M guanidine-HCI (in 10 mM I ris-HCI. pll K.O; 0.005% thimerosal) and re- 
cquilibralcd w ith PBS. I he column was stored at 4%\ 

Aliquois of a pMAI870-2680 (Interval 6) or a CTA IgY polyclonal antibody 
preparation (PlKi prep) were affinity purified on the above affinity column as Ibllows. fhe 
column was attached lo an I iV monitor (iSCO) and washed with IM^S. I or pMA 1870-2680 
IgY purification, a 2X PFCi prep (filter sterilized using a 0.45 [\ filter: approximately 500 mg 
total IgY) was applied. The column was washed wiih PBS until the baseline was re- 
established (ihe column fiow-ihrough was saved), washed with liBS 1 ween U) eluie 
nonspecifically binding antibodies and re-equilibrated with PBS. Bound anlibodv was eluled 
from the column in 4 M guanidinc-l ICI (in 10 mM I ris-IICI. pH S O: 0.005% ihimero.sal). 
fhe entire elulion peak was collected in a 15 ml lube (1 alccm), I lie column was re- 
equiiibrated and the column eluate was le-chromalographed as described abo\c. I he antibody 
preparation was quantified by I 'V absorbance (the elulion buflt'r was used to zero the 
spectrophotometer). Total purified antibody was approximately 0 mg and \ mg from the first 
and second chromatography passes, respeeliveiy. I he low yield from the second pa.ss 
indicated that most specific antibodies were removed by ihe first round of chromatography. 
The estimated percentage of Interval 6 specific antibodies in the pMA 1 870-2680 PRO prep is 
approxi match 2%. 

The percentage of Interxal 6 specific antibodies in the CI A PFXl prep was determined 
(utilizing the same column and methodology described above) to be approximatciv 0.5% of 
total IgY, 



- 96 - 



WO98A>B540 



PCT/1)S97/15394 



A 4X PEG prep contains approximately 20 mg/m! l)iY. Thus in b) above, 
approximately 400 |.ig specific antibody in the Interval 6 PEG prep neuirahzcd 30 toxin A 

in vivo, 

EXAMPLE 16 

//; l 'iv(f Treatment Of C . cliffidlc Disease In 
Jiamsiers By Recombinani Interval 6 Aniibodies 

The ability of antibodies directed against recombinant Interval 6 of toxin A to protect 
hamsters in vivo from ( \ ciifficiie disease u'as examined. This example involved: (a) 
prophylactic treatmeni ol ( \ iUfficik' disease and (b) therapeutic treatment ofC Jifficiic 
disease. 

a) Prophylactic Treatment Of C. difficile Disease 

This experiment was perlormed as described in lixample ^)(b). Three groups each 
consisting of 7 female 100 gram Syrian hamsters {Charles Ri\er) were prophyiactically treated 
with either prcimmune IgYs. IgYs against native toxin A and B (CTAB: see [ixample K (a) 
and (b)| or IgYs against Interval 6. IgYs were prepared as 4X PHG preparations as described 
in Flxample ^^(a). 

riic animals were orally dosed 3 times daily, roughly at 4 hour inter\ais. lor 12 days 
with I ml antibody preparations diluted in tnsureK;. Using estimates of specific antibody 
concentration trom Example 15(c). each dose of the Interval 6 antibody prep coniaiiKd 
appro\imatcl\ 4{)() pg of spcciilc antibody. On day 2 each hamster was predisposed to ( 
iiifficiie infection by the oral administration of 3.0 mg of Clindamycin-IICl (SignuO in I ml 
of water. On day 3 the hamsters were orally challenged with I ml of (' (iifficiic inoculum 
strain ATCC 43506 in sterile saline containing approximately 100 organisms. The animals 
were then observed for the onset of diarrhea and subsequent death during the treatment 
period. ! he results are shown in Table 10. 
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TABLE 19 

Ixlhality Afier 12 Days OfTrcatmeni 



Treaimcnt Group 


Number Animnis Alive 


Number Animals Dead 


Prcimmunc 


0 


7 


ClAB 


6 


I 


Inicrvnl 6 


7 


0 



TrealniLMii of hamsters vviih orally-adminislercd luYs against Interval 6 succcsslullv 
protected 7 out of 7 (100%) of the animals from ( ' clifficilc disease. One of the hamsters in 

10 this group presented with diarrhea which subsequently resolved during ilic course of 

ireatmenl. As shown previously in lixampie 9, antibodies to native toxin A and toxin B were 
highly proiccii\e. In this i-Aamplc. (i out of 7 animals survived in the C I AB treatment group. 
All of the liamslers treated with preimmune sera came down with diarrhea and died. The 
sur\ivors in both the C TAB and hiterva! 6 groups remained healthy ihroughoui a 12 da\ posi- 

15 treatment period. In particular. 6 out of 7 Interval C-trcated hamsters survived at least 2 
weeks after lerminaiion of treatment which suggests that these antibodies provide a long- 
lasting cure. These results represent the first demonstration that antibodies generated against a 
recombinant region of toxin A can prevent CDAI) when administered passiveK to animals. 
1 hcse resuhs also indicate that antibodies raised against Interval () alone ma\ be sunicicnt to 

20 protect animals iVom (' cli/ficilc disease when administered prophylaciicalK . 

IVe\ iousK others had raised antibodies against toxin A by aciivel> immuni/.i ng 
hamsters against a recombinant polypeptide located within the interval 6 region ILyerK. 
[)-M.. c/ t// (I WO) Curr. Microbiol. 21:29]. Figure 17 shows schematically the location o! 
the Lx erly. c/ nl, inlra-lnterval 6 recombinant protein (cloned into the pUC \ector) in 

25 comparison with the complete Interval 6 construct (pMA 1 870-2680) used herein to generate 
neutralizing antibodies directed against toxin A. In I'igure 17, the solid black o\al represents 
the MBP which is fused to the toxin A interval 6 in pMAl 870-2680. 

I he Lyerly, ct al. antibodies (intra-Interval 6) were only able to partially protect 
hamsters against ( * difficile infection in terms oi' survival (4 out of 8 animals survived) and 

30 furthermore, these antibodies did not prevent diarrhea in an> of the animals. Additionally, 
animals treated with the intra-lntcrval 6 antibodies [l.ycrly. c/ al (IWO). .v///7/y/| died when 
treatment was removed. 

In contrast, the experiment shown above demonstrates that passive administration ol 
anti-lnter\ al 6 antibodies prevented diarrhea in 6 out of 7 animals and completely prevented 
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death due to CD AD. Furthermore, as discussed above, passive administration of the anti- 
Inicrval 6 antibodies provides a long lasting cure {i.e . ireatmeni could be withdrawn without 
incident). 

5 b) Therapeutic Treatment Of C difficile Disease: //; Vivo 

Treatment Of An Established C difficile infection (n 
Hamsters With Recombinant lnter>'al 6 Antibodies 

The ability of antibodies against recombinant interval 6 of toxin A to therapeutically 
ircai ( '. tiitfk ilc disease was examined. The experiment was performed essentially as 

10 described in Lixample 10(b), Three groups, each containing seven to eiglu female Golden 
Syrian hamsters ( 100 g each: Charles River) were treated with either preimmune IgY, IgYs 
against native toxin A and toxin B (CTAB) and IgYs against Interval 6. The antibodies were 
prepared as described above as 4X PFXi preparations. 

The hamsters were Inst predisposed to difficile infection with a 3 mg dose of 

15 C'iindannein-nCl (Sigma) administered orally in I ml of water. Approximalel> 24 hrs later, 
the animals were orall\ challenged with 1 ml off. difficile strain A TCC 435% in sterile 
saline containing approximately 200 organisms. One day after infection, the presence of toxin 
A and \\ was determined in the feces of the hamsters using a commercial inmrunoassay kit 
(C'ytiiclune A-^B LPA. Cambridge Biotech) to verify establishment of infection, I our 

20 members ol eaeh group were randomly selected and tested. Feces from an uniniected hamster 
was tested as a negative control. All infected animals tested positive for the presence of toxin 
according lo the nianufaciurer's procedure. The initiation of treatment then started 
approximately 24 hr post-infection. 

I he animals were dosed daily at roughly 4 hr intervals with I ml antibod\ preparation 

25 diluted in FnsureK (Ross Fabs). t he amount of specific aniibt>dies gi\en per dose 

(determined by affmity puritlcalion) was estimated to be about 400 pg of anti-Interval 6 IgY 
(ibr animals in the Interval 6 group) and 100 |.Lg and 70 [.ig of anti-toxin A (laterval 6- 
specillc) and anti-toxin (Interval ^-specific: see Example 19). respectively, for the CTAB 
preparation. I he animals were treated for 9 days and then observed for an additional 4 days 

30 for the presence of diarrhea and death. The results indicating the number of survivors and the 
number of dead 4 days posi-infeciion are shown in Table 20. 
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TABLE 20 

in vivo Therapeutic Treaimenl With Inlcrval 6 Antibodies 



Treatment Group 


Number Animals Alive 


Number Animals Dead 


Prcimmune 


4 


3 


CTAB 


8 


0 


Interval 6 


8 


0 



Aiilibodies direcied agaiiisi both Interval 6 and CTAli successfully prevented death 
iVoin ( * tli/fkilc when thcrapcuticalh administered 24 hr alter infection. Iliis result is 
significant since many investigators begin therapeutic treaimenl of hamsters with existing 
drugs (t'.tf.. xancomycin, phenellamycins. tiacumicins. etc.) 8 hr post-infection [Swanson. cf 
uL ( ! Wl ) Antimicrobial Agents and Chemotherapy 35:1 108 and ( ]9Kf>) J. Antibiotics 42:94|. 

f oriy-lwo percent of harnsters tteated with prcimmune IgY died from C .'DAO. While 
the anti-Interval 6 antibodies prevented death in the treated hamsters, tiiey did not eliminate 
all symptoms of CDAO as 3 animals presented with slight diarrhea. In addition, one C* PAH- 
treated and one preimmune-treaied animal also had diarrhea 14 days pDsi-inlection. fhese 
results indicate that anti-lnlcrval 6 antibodies provide an effective means of therapy for 
CDAIX 



EXAMPLE 17 

Induction Of T oxin A Neutralizing Antibodies Requires Soluble interval 6 Protein 

As shown in I*xamples i 1(d) and 15. expression of recombinant proteins in E. coli 
may result in the production of either soluble or insoluble protem. If insoluble protein is 
produced, the recombinant protein is solubihzed prior to immunization of animals, l o 
determine whether, one or both of the soluble or insoluble recombinant proteins could be used 
to generate neutralizing antibodies lo toxin A, the following experitnent was performed. This 
example involved a) expression of the toxin A repeats and siibfragmenls of these repeats in F.. 
coil using a variety of expression vectors: b) identification of recombinant loxin A repeals and 
sub-regions to which neutralizing antibodies bind: and c) determination of the neutralization 
ability of antibodies raised against soluble and insoluble toxin A repeat immunogen. 
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a) Expression Of The Toxin A Repeats And Siibfragments Of 
These Repeats In £1 coli Using A Variety Of Expression 
Vectors 

The Interval 6 inimunogcn utilized in Examples 15 and 16 was the pMAl 870-2680 
piolein. in which the toxin A repeals arc expressed as a soluble fusion protein with the MBP 
(described in Example 1 1). Interestingly, expression of this region (from the Spcl site to the 
end of the repeats, see Figure 15B) in three other expression constructs, as either native 
(pPA 1X70-2680). poly-llis tagged [pPA 1 870-2680 (11)1 or biotin-iagged (pBAl 870-2680) 
proteins resulted in coniplctely insoluble protein upon induction ol* the bacterial host (see 
Figure The host strain BF21 (Novagcn) was used for expression of pB A 1 870-2680 and 

host strain BF21(Dr;3) f Novagcn) was used for expression of pPAl 870-2680 and pPAl870- 
2680(11). I hese insoluble proteins accumulated to high levels in inclusion bodies. Expression 
of rect)nibinant plasniids in co/i host cells grown in 2X YT medium was performed as 
described | Williams, e/ aL (1995). .supra]. 

As summari/ed in Figure 15B. expression of fragments of the toxin A repeats <as 
cither N-ierminal Spcl-IiafRl fragments, or C-terminal ^tv^Rl-end fragments) also yielded 
high levels of insoluble protein using pCiEX (p(iAl870-21 W). PinPoini'"-Xa ipBAl 870-2190 
and pliA2250-2680) and pI:T expression systems (pPA 1870-2 190). I he pCiIiX and pFT 
expression systems are described in Example 11. I he PinPoini'^^-Xa expression system drives 
the expression of fusion pnneins in t oll f usion proteins tVom PinPoinl'^'-Xa vectors 
contain a biotin tag at the amino-terminal end and can be affinity purified SofiLink'^' Soft 
Release avidin resin tPromega) under mild denaturing conditions (3 niM biotin). 

I he solubility ot expressed protems from the pPG 1 870-2190 and pl>A1870-2l90 
expression constructs was determined after induction ol' recombinant protein expression under 
conditions reported to enhance protein solubility [These conditions comprise grov\ih oi' the 
host al reduced temperature (30%") and the utilization of high (1 niM IP'I (i) or low (0.1 mM 
IPTG) concentrations of inducer [Williams ei ai (1995), .utpral All expressed recombinant 
toxin A protein was insoluble under these conditions. Thus, expression of these traunients of 
the toxin A repeals in pET and pGEX expression x eciors results in the production of insoluble 
recombinant protein even when the host cells are grown al reduced temperature and using 
hnver concentrations of the inducer. Although expression of these fragments in pMal vectors 
>ielded affinity puriilable soluble fusion protein, the protein was either predominantly 
in.soiuble (pMAI870-2190^ or unstable {pMA2250-2650). Attempts to solubilize expressed 
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protein Ironi the pMA 1870-2 190 expression construct using reduced temperature or lower 
inducer concentration (as described above) did not improve fusion protein solubility. 

Collectively, these results demonstrate that expression of the toxin A repeal region in 
E. coil results in the production of insoluble recombinant protein, when expressed as cither 
large (aa 1870-2680) or small (aa 1 870-21 W or aa 2250-2680) fragments, in a variety of 
expression vectors (native or poly-bis tagged pET. pGLX or PinPoini*"^'- Xa vectors), utilizing 
growih conditions shown to enhance protein solubility. The exception to this rule were 
fusions with the MBI\ which enhanced protein solubility, cither partially (pMA 1870-2 190) or 
fully (pMA 1 870-2680). 

h) Identification Of Recombinant Toxin A Repeats And Sub- 
Regions To Which Neutralizing Antibodies Bind 

l oxin A repeal regions to which neutralizing antibodies bind were idcntilled by 
utilizing recombinant loxin A repeal region proteins expressed as soluble or insoluble proteins 
to deplete protective antibodies from a polyclonal p<M^i of antibodies against native ( liif/inic 
toxin A. An /// vivo assay was developed. to evaluate proteins for the ability lo hind 
neutralizing antibodies. 

fhc rational for this assay is as follows. Recombinant proteins were first pre-mixed 
with antibodies against native toxin A (CTA antibody: generated in l Aample K) and allowed 
lo react. Subsequently, C\ cfiffkilc loxin A was added at a concentration lethal to hamsiers 
and the mixlure was administered to hamsters via IP injection. If the recombinant protein 
contains neutralizing epitopes, the CI A anlibodies would lose their ability to hind loxin A 
resulting in diarrhea and/or death of the hamsters. 

I he assay was performed as follows, fhc lethal dose of loxin A when delivered orally 
lo nine 40 to 50 g Oolden Syrian hamsters (Sasco) was deiermincd to be 10 lo 30 ^g. I he 
PFXi-puritled CTA antibody preparation was diluted lo 0.5X concentration (/.e . the antibodies 
were diluted at twice ihe original yolk volume) in 0.1 M carbonate buffer. pH 9.5. fhc 
anlibodies were diluted in carbonate buffer to proiect them from acid degradation in ihe 
stomach. I he concentration of 0.5X was used because it was found to be the lowest effeclive 
concentration against loxin A. riie concentration of Interval 6-specific antibodies in the 0.5X 
CfA prep was estimated lo be 10-15 |.ig/ml (estimated using the method described in 
lixample 15). 
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The inclusion body preparation [insoluble Interval 6 protein: pPA1870-2680(H)l 
the soluble interval 6 protein (pMA 1870-2680; see Figure 15] were both compared for their 
ability lo bind to neutralizing antibodies against C\ Jijficife toxin A (CTA). Specifically. 1 to 
2 mg oi recombinant protein was mixed with 5 mi of a 0.5X CTA antibody prep (estimated 

" 5 to contain 60-70 j.ig of Interval G-spccillc antibody). After incubation for 1 hr at 37°C\ CTA 
{ Vcch Lab) at a tmal concentration of 30 |.ig/ml was added and incubated for another 1 hr at 
37°C. One ml of this mixture containing 30 |.ig of toxin A (and 10-15 |.ie of Interval 6- 
spccillc aniibody) was administered oralh lo 40-50 g Cioldcn Syrian hamsters (Sasco). 
Recombinant proteins that result in the loss of neutralizing capacity of the CTA antibody 

10 would indicate that those proteins contain neutralizing epitopes. Preimmune and CTA 

antibodies (both at 0.5X) without the addition of any recombinant protein served as negative 
and positive controls, respectively. 

I wo other inclusion body preparations, both expressed as insoluble products in the 
pl-T vector, were tested: one containing a different insert (toxin B fragment) other than 

15 Interval 6 called pi*B 1 850-2070 (see i-'igure 18) which serves as a control for insoluble 

Interval 6. the other was a truncated version of the Interval 6 region called pl*A 1870-2 190 
(.see l igurc 15B). Fhe results of this experiment are shown in Tabic 21. 

TABLE 21 



fiindiiii: OI Neutrnlizing Antibodies B\ Soluble Interviil 6 Protein Study Oiilcomc Atier 24 Hours 



Treatment Ciroup' 


Healthy- 


Diarrhea" 


I>ead' 


Prcimiminc Al^ 


0 






C TA Ah 


A 


I 


{) 


t l A Ab - Im h (soiiible) 


\ 






C I A Ab - \n\ 6 tmsoliiblc) 


5 


0 


0 


CTA Ah ^ pPB 1 8.^0-2070 


5 


(1 


0 


CTA Ab ^ pPA 1870-2 190 


5 


0 


0 



( " ihfik'iic toxin A (CTA) was added to each ^roup. 
Animals showed no signs ol illness. 
30 AiMmals developed diarrhea but did not die. 

' Animals developed diarrhea and died. 

Preimmune antibody was ineffective against toxin A. while anli-C l A antibodies at a 
dilute 0,5X concentration almost completely protected the hamsters auainst the enterotoxic 
effects of CTA. The addition of lecombinant proteins pPB 1 850-2070 or pPA 1870-2 1 90 lo 
35 the anti-(TA antibody had no elfect upon its protective ability, indicating that these 

recoiiibinant proteins do not bind lo neutralizing antibodies. On the other hand, recombinant 
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Interval 6 protein was able to bind lo neutralizing anti-CTA antibodies and neutralized the in 
viva protective effect of the anti-CTA antibodies. J our out of five animals in the group 
treated with anti-CTA antibodies mixed with soluble Interval 6 protein exhibited toxin 
associated toxicity (diarrhea and death). Moreover, the results showed that Interval 6 protein 
must be expressed as a soluble product in order for it to bind to neuiralizinii anti-C TA 
antibodies since the addition of insoluble Interval 6 protein had no elTect on the neuiralizinu 
capacity of the CTA antibody prep. 



c) Determination Of Neutralization Ability Of Antibodies 
Raised Against Soluble And Insoluble Toxin A Repeat 
Immunogcn 

I o determine if neutralizing antibodies are induced against solubilized inclusion 
bodies, insoluble toxin A repeat protein was solubilized and specific antibodies were raised in 
chickens. Insoluble pPAl 870-2680 protein was solubilized using the method described in 

\^ Williams c/ ul. ( 1995). supra. Brietly, induced cultures <5()0 mh were pelleted by 

centrifugation at 3.000 X g for 10 min at 4°C. Ihe cell pellets were rcsuspended thoroughly 
in 10 ml ot inclusion body sonication buffer (25 mM HIiPES pll 7.7. 100 mM KCl. 12.5 mM 
MgCI.. 20% glycerol, 0.1% (v/v) Nonidct IMO. I mM DTI ). The suspension was transferred 
to a 30 mi non-glass centrifuge tube. l ive hundred |.il of 10 mg/ml lysozyme was added and 

20 the tubes were incubated on ice for 30 min. fhe suspension was then frozen at -70°C for at 
least 1 hr. 1 he suspension was thawed rapidly in a water bath at room temperature and then 
placed on ice. The suspension was then sonicated using at least eight 15 sec bursts of the 
micropmbe (Branson Sonicator Model No. 450) with intermittent cooling on ice. 

I he sonicated suspension was transferred to a 35 ml Oakridge tube and cenirifuged at 

25 6.000 X g lor 10 min at 4°C to pellet the inclusion bodies. I he pellet was washed 2 limes bv 
pipetting or vortexing in fresh, ice-cold RlPA buffer |0.l% SDS. i% Triton X-IOO. 1% 
sodium deoxycholale in TBS (25 mM Tris-CI pH 7.5. 150 mM NaC l)]. Ihe inclusion bodies 
were recentrifuged after each wash, fhe inclusion bodies were dried and transferred using a 
small metal spatula to a 15 ml tube (I'alcon). One ml of 10% SDS was added and the pellet 

30 was solubilized by gently pipelting the solution up and down using a 1 ml micropipettor. The 
solubilization was facilitated by heating the sample to 95°C when nece.s.sar> . 

Once the inclusion bodies were in solution, the samples were diluted with 9 volumes 
of PBS. I he protein solutions wete dialyzed overnight against a 100-fold volume of PBS 
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containing 0.05% SDS at room temperature. The dialysis buffer was then changed to PBS 
containinti 0.01% SDS and llie samples were dialyzed for several hours lo overnight at room 
lemperaiLire. The samples were stored at 4°C until used. Prior to further use, the samples 
were warmed to room temperature to allow any precipitated SDS to go back into solution. 
- 5 The inclusion body preparation was used to immunize hens. I he protein was dialyzcd 

into PBS and emulsified wiih approximately equal volumes of CFA for the initial 
imnuinizalion or II- A for subsequent booster immunizations. On day zero, lor each of the 
recombinant recombinant preparations, iwo egg laying while l.eghorn hens were each injected 
ai multiple siies (IM and SC) with I ml of recombinant protein-adjuvant mixture containing 

10 approximately O S- 1.5 mg of recombinant protein. Booster immunizations of 1.0 mg were 

given of da\ s 14 and day 28. I;ggs were collected on day 32 and the antibody isolated using 
IM'G as described in Llxample 14(a). High liters of loxin A specific aniibodics were present 
(as assayed by IILISA. using the method described in F.xample !->>. Titers were determined 
for bmh aniibodics agamsl recombinant pi>lypcptides pPA 1870-2680 and pMA 1 870-2680 and 

15 were found lo be comparable at > 1:62.500. 

-Aniibodies againsi soluble Interval 6 (pMAl 870-2680) and insoltible Interval 6 
|( inclusion body). pPA1870-2680| were tested for neutralizing ability again.st toxin A using 
the In vivo assay described in Hxample 15(b). Preimmune antibodies and antibodies against 
loxin A (C I A) served as negative and positive controls, respectively. 1 he results are shown 

20 in Table 22. 



TABLE 22 

Nciiinili/iuion Ol Toxin A B\ Antibodies Against Soluble Interval 6 Protein Stmly Outcome Alter 24 Hours 



Antibody ireauiienl Group 


HealUi> ' 


Diarrlica- 


Dead' 


I'reiniiniine 


t 


0 


4 


CTA 


s 


(» 


(» 


Interval 6 (SoUibleV 


5 


0 


0 


tniervai 6 (insoiiiWe) 


0 


-i 


.> 



' Animals .stiowed no siun of illness. 

30 Animal developed diarritca but did not die. 

Animal developed diarrhea and died. 
400 tiu ml. 

Antibodies raised againsi native toxin A were protective while preimmune antibodies 
had liiile elTect. As found using the in viiro CHO assay {described in Example 8(d)) where 
35 antibodies raised against the soluble Interval 6 coijld partially neutralize the effects of loxin A. 
here ihcy were able to completely neutralize toxin A in vivo. In contrast, the antibodies 
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raised against the insoluble Interval 6 was unable to neutralize the effects of toxin A in vivo 
as shown above (Table 22) and in viiro as shown in the CHO assay (described in Example 

mi 

These results demonstrate that soluble toxin A repeal immunogen is necessary to 
induce the production of neutralizing antibodies in chickens, and that the generation of such 
soluble immunogen is obtained only with a specific expression vector (pMal) containing the 
Kwin A region spanning aa 1870-2680. That is to say. insoluble protein thai is subsequently 
solubilized does not result in a toxin A antigen that will elicit a neutralizing antibody. 

EXAMPLE 18 

Cloning And Expression Of The ( \ liifjicik' 1 ox in B Ciene 

I he loxin B gene has been cloned and sequenced: the amino acid sequence deduced 
from the cloned nucleotide sequence predicts a MW of 269.7 kD lor loxin B |Barroso e/ t//.. 
Nucl. Acids Res. 18:4004 (I WO)], fhe nucleotide sequence of the coding region of the entire 
loxin B gene is listed in SEQ ID NO:9. The amino acid sequence of the entire loxin B 
protein is listed in SEQ ID NO: 10. fhe amino acid sequence consisting of amino acid 
residues 1850 through 2360 of toxin B is listed in SEQ ID NO: I L I hc amino acid sequence 
consisting o( amino acid residues 1750 through 2360 of toxin \l is li.sted in SEQ ID NO:i2. 

C iiven the expense and difficulty of isolating native loxin B proiein. it would be 
advaniaget>us lo use simple and inexpensi\e procaryotic expression s> stems lo produce and 
purify high levels of recombinant toxin B proiein lor immumzalion purposes. Ideally, the 
isolated recombinant proiein would be soluble in order lo preser\e native antigenicil) , since 
solubilized inclusion body proteins often do not fold into native conlbrmalions. Indeed as 
shown in Example 17. neutralizing antibodies against recombinant toxin A were only obtained 
when soluble recombinant toxin A pc^lypeplides were used as the immunogen. I o allow ease 
{>r purification, the recombinant proiein should be expressed to levels greater than 1 mg/liier 
of E. citlt culture. 

fo determine whether high levels of recombinant loxin B proiein could be produced in 
/:. t7j//. fragments of the toxin B gene were cloned into various prokaryoiic expression 
\ ectors. and assessed for the ability to express recombinant toxin B protein in E. coli. I his 
l-.xamplc in\ olved (a) cloning of the loxin B gene and (b) expression ol' the toxin B gene in 
/:. coli. 
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a) Cloning Of The Toxin B Gene 

The loxiii B gene was cloned using PCR amplificaiion from C cliffwUc genomic DNA. 
lnitiall\-. the ^enc was cloned in two overlapping tVagmenis, using primer pairs P5/P6 and 
P7/P8. I he location of these primers along the toxin B gene is shown schematically in Figure 
18. Tlie sequence of each of these primers is: P5: 5' TAGAAAAAATGGCAAATGT 3* (SEQ 
ID NO: II ): Pr>: 5" TTTCATCTTGTA GAGTCAAAG V (SEQ ID N0:I2): 
P7: 5' GAKiCCACAAGATGATTTAGTG y (SEQ ID NO: 13): and 
P8: 5" CTAATTGAGCTGTAI CAGGATC Y (SEQ ID NO: 14). 

l- igure 18 also shows the location of the following primers along the toxin B gene: P9 
which consists of the sequence 5" CGGAATTCCI AGAAAAAATGGCAA ATG 3' (SEQ ID 
NO: 1 5): IMO which consists of the sequence 5" GCTCTAGAATGA CCATAACiCTAGCCA 
3" (S1:Q id NO: 16): PI I which consists of the sequence 

5' C (KiAA r rCCiAGTT(K]TAGAAA(;GTGGA 3' (SEQ ID NO: I 7): PI 3 which consists of 
the sequence 5* C GCiAA ITCGGTTATTATCTI AAGGATG 3" (SEQ ID NO: 1 8): and PI4 
which consists of the sequence 5* CCiGAATTCTTGATAACTGGA'f TTGTGAC 3" (SEQ ID 
NO: 19) Ihe amino acid sequence consisting of amino acid residues 1852 through 2362 of 
toxin B is listed in SEQ ID NO:20. The amino acid sequence consisting of amino acid 
residues 1 7::^5 through 2362 of toxin B is listed in SEQ ID NO:21. 

ilostnJium diffkilc VP! strain 10463 was obtained from the American iypc Culture 
Collection (ATCG 43255) and grown under anaerobic conditions in brain-heart infusion 
medium (Ikxton Dickinson). High molecular-weight L\ Jiffrci/c DNA was isolated essentially 
as described jWren and Tabaqchaii (1*)87) J. Clin. Microbiol.. 25:2402 J. excepl I) 100 ^g/ml 
proteinase K m 0.5% SDS was used to disrupt the bacteria and 2) ceiyirimethx lammonium 
bromide (C'fAB) precipitation |as described by Ausubcl a/., Eds., Current Protocols in 
Molccufar Bioloi^w Vol. 2 (1989) Current Protocols) was used to remove carbohydrates from 
the cleared lysate. Brictly. after disruption of the bacteria with proteinase K. and SDS, the 
solution is iidjusted to approximately 0.7 M NaCI by the addition of a 1/7 volume of 5M 
NaCI. A 1/10 volume of CTAB/NaCI (10% CfAB in 0.7 M NaCl) .solution was added and 
the solution was mixed thoroughly and incubated 10 min at 65'*C- An equal volume of 
chioroform/isoamyl alcohol (24:1) was added and the phases were thoroughly mixed. The 
organic and aqueous phases were separated by centrifugation in a micro fugc for 5 min. i he 
aqueous supernatant was removed and extracted with phenol/chloroibrm/ isoaniyl alcohol 
(25:24:1 ). The phases were separated by centrifugation in a microfugc for 5 min. The 
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supernatant was transterrcd to a fresh tube and the DNA was precipitated with isopropanoL 
The DNA precipitate was pelleted by brief centrifugal ion in a microfuyie. The DNA pellet 
was washed with 70% ethanol lo remove residual C I AB. The DNA pellet was then dried 
and redissolvcd in TH buffer (10 mM Tris-HCl pH8.0. 1 mM l^DTA). The iniegriiy and 
yield oi' genomic DNA was assessed by comparison with a serial dilulion of uncut lambda 
DNA after electrophoresis on an agarose gel. 

Toxin B fragments were cloned by PCR utilizing a proolrcading thermostable DNA 
polymerase | native rju polymerase (Stratagene)). I he high lldcliiy of this polymerase 
reduces the mutation problems associated with amplification by error prone polvmcrascs (c t^.. 
Taq pt>lymerase). PCR amplification was performed using the PCT^ primer pairs P5 (SHQ ID 
NO;I I) with P6 (SHQ ID N():12) and VI (SHQ ID NO:13) with PK (Sl:0 ID NO:l4) in 50 
Ml reactions containing 10 mM Tris-HCI pH8.3. 50 mM KCf 1.5 mM MgCl.. 200 ^M of 
each d\' I P, 0.2 |.iM each primer, and 50 ng C. Ji/j/ici/c genomic DNA. Reactions were 
overlaid with 100 ^1 mineral oiL heated lo 94°C for 4 min. 0.5^1 native Pfii polymerase 
(Stratagene) was added, and the reactions were cycled 30 times at ^)4°C for 1 min. 50°C for I 
min. 72°C (2 min for each kb of sequence lo be amplified), followed by 10 min at 72°C . 
Duplicate reactions were pooled, chloroform extracted, and ethanol precipitated. After 
washing in 70% ethanol. the pellets were resuspendcd in 50 ^1 TI- bufjer ( 10 mM Tris-llCI 
pHK.O. 1 mM HDTA). 

The i*5/P6 amplification product was cloned into plK 19 as outlined below*. 10^1 
aliquois o\ DNA were gel puritlcd using the Prep-a-(iene kit (BioRad), and ligatcd to Sma) 
restricted pl'C!9 vector. Recombinant clones were isolated and conllrmed by restriction 
digestion using standard recombinant molecular biolog> techniques (Sambrook cf aL. 1989). 
Inserts from two independent isolates were identified in which the toxin B insert was oriented 
such that the vector Bam\\\ and Sttcl sites were 5" and 3* oriented. respecti\ely (pUCBlO- 
1530). I he insert-containing BumHMSacI fragment was cloned into similarly cut pli'f23a-c 
\ector DNA, and protein expression was induced in small scale cultures (5 ml) of identified 
clones. 

lolal protein extracts were isolated, resolved on SDS-PACil£ gels, and toxin B protein 
identified by Western analysis utilizing a goat anti-toxin affmity purified antibody (Tech 
l.ab). Procedures for protein induction. SDS-PAGH. and Western blot analysis were 
performed as described in Williams et al, (1995). supra. In brief. 5 ml cultures of bacteria 
grown in 2XYT containing 100 pg/ml ampicillin containing the appropriate recombinant clone 
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were induced to express recombinant protein by addition of IPTG to ImM. TTie E. coli hosts 
used were; BL2UDE3) or BL2KDE3)LysS (Novagen) for pET plasmids. 

Cultures were induced by the addition oF IPTG to a final concenlralion of 1.0 mM 
when ihe cell density reached 0.5 00^,^,, and induced protein was allowed lo accumulate for 
two lirs after induction. Protein samples were prepared by pelleting 1 ml aliquois of bacteria 
by centrifugalion (i niin in microfuge). and resuspcnsion of the pelleted bacteria in 150 \\\ of 
2X SDS-PAGE sample buffer (0.125 niM Tris-HCI pH 6.8. 2 mM EDTA. 6% SDS, 20% 
giyccroi. 0.025% bromophenol blue: (i-mercaptoethanol is added lo 5% belbre use). The 
samples were heated to ^)5°C for 5 min. then cooled and 5 or 10 f.ils loaded on 7.5% SDS- 
PAGE gels. High molecular weight protein markers (BioRad) were also loaded, to allow 
estimation of ihc MW of identified fusion proteins. After electrophoresis, protein was 
detected either generally by staining the gels with Coomassie Blue, or specifically, by blotting 
lo niircKcllulo.so ibr Western blot detection of specific immunoreaciivc protein. I he MW of 
induced toxin B reactive protein allowed the inlegrily ot the toxin B reading irame lo be 
determined. 

I he pl:T2.>b recombinant (pPBl 0-1530) expressed high MW recombinant toxin B 
reactiw protein, consistent with predicted values. This confirmed that frame terminating 
errors had not occurred during PGR amplification. A pET23b expression clone containing the 
n)-l75()aa interval of the toxin B gene was constructed, by fusion of the EcoRySpe] 
Iragmeni of the P7/P8 amplification product to the P5-Z:a^RV inler\ al of the P5/P6 
amplification product (see l igurc 18) in pPBlO-1530. The integrity of this clone (pPBIO- 
1750) w as confirmed b\ restriction mapping, and Western blot detection of expressed 
recombinani toxin B protein. Levels of induced protein from both pPBlO-1510 and pPBlO- 
1750 were loo low lo faciliiaie purificaiion of u.sable amounts ol recombinant loxin B proiein. 
flie remaining 1750-2360 aa interval was directly cloned into pMAL or pHT expression 
vectors from the P7/P8 amplification product as described below. 

h) Kxprcssion Of The Toxin B <iene 

I) Ovcrx'icw Of Expression Methodologies 

f ragments of the loxin B gene were expressed as either native or fusion proteins in E. 
ciili. Naii\e proteins were expressed in either the pE r23a-c or pETlGb expression vectors 
(Novagen). The pET23 vectors contain an extensive poly linker sequence in all three reading 
/Vamcs {L\-c vectors) followed by a C-terminal poly-histidine repeal. The pETI6b vector 
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contains a N-terminal poly-histidine sequence immediately 5" to a small polylinker. The 
poly-histidinc sequence binds to Ni-Chelate columns and allows affinity purification of tagged 
target proteins [Williams ci ai (1995), supral These affinity tags arc small (10 aa for 
pE ri6b. 6 aa for pHT23) allowing the expression and affinity purification of native proteins 
5 with only limited amounts of foreign sequences. 

An N-(erniinal histidine-iagged derivative of pE ri6b containing an extensive cloning 
cassette was constructed to facilitate cloning of N-ierniinal poly-histidinc lagged toxin B 
expressing constructs. This was accomplished by replacement of the promoter region of the 
pf'T23a and b vectors with that of the pET]6b expression vector. Mach vector was restricted 

H) with and A'Jt'l. and the reactions resolved on a 1.2 % agarose gel. The pFl 16b 

promoter region (contained in a 200 bp BjilW-Ndel fragment) and the promoter-less pET23 a 
or b vectors were cut from the get mixed and Prep-A-Cicnc (BioRad) purified. The eluted 
DNA was ligated. and Iranslbrmanis screened ibr promoter replacement by Acrd digestion oi' 
purified plasmid DNA (the pETl6b promoter contains this site, the pEr23 promoter does 

15 not), i hese clones (denoted pHTIIisa or b) contain the pETlGb promoter (consisting of a 

p'r7-lac promoter, ribosome binding site and poly-histidine (lOaa) sequence) (used at tiie Nde\ 
site to the extensive pE'f23 polylinker. 

All MBP fusion proteins were constructed and expressed in the pMAI.'^'-c or 
pIVlAL'''*-p2 vectors (New England Biolabs) in which the protein of interest is expressed as a 

20 C-ierminal fusion with MBP. All pET plasmids were expressed in either the Bl,21(I)E3) or 
BL2l(I)E3)l.ysS expression hosts, while pMal plasmids were expressed in the BE21 host. 

Large scale (500 mis to 1 liter) cultures of each recombinant were grown in 2X Y 1 
broth, induced, and soluble protein fractions were isolated as described | Williams, el al. 
( 1995). siipra\. The soluble protein extracts were affinity chromaiographed to isolate 

25 recombinant fusion protein, as described | Williams et al.. (1995) supraY In brief, extracts 

containing tagged pET fusions were chromatographed on a nickel chelate column, and eluted 
using imidazole salts or low pH (pi I 4.0) as described by the distributor (Novagen or (JiagenI, 
llxtracts containing soluble pMAE fusion protein were prepared and chromaiographed in PBS 
buffer o\er an amylose resin (New England Biolabs) column, and eluted with PliS containing 

30 10 mM maltose as described (Williams ci al, (1995). supra]. 
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ii) Overview Of Toxin B Expression 

In both lame expression constructs described in (a) above, only low level (i.e.. less 
than I mg/liter oi intact or nondeiiradcd recombinanl protein) expression of recombinant 
protein was detected. A number of expression constructs containing smaller fragments of the 

" 5 toxin B gene were then constructed, to determine if small regions of the gene can be 

expressed to high levels {i.e.. greater than I mg/liler intact protein) without extensive protein 
degradation. All were constructed by in frame fusions of convenient toxin B restriction 
fragments to either the pMAL-c. p]L'T23a-c, pt:Tl6b or pClHisa-b expression vectors, or by 
engineering reslriciion sites at specific locations using PCR amplification (using the same 

10 conditions described in (a) above]. In all cases, clones were verified by restriction mapping, 
and. where indicated. DNA sequencing. 

Protein preparations from induced cultures of each of these constructs were analyzed, 
by SDS-PACil!, to estimate protein s[abiliiy (Cot)massie Blue siaining) and immunoreaciiviiy 
against anii-ioxin B specific antiserum (Western analysis). Higher levels of intact {Lc , 

15 nondegraded ). full length fusion proteins were observed with the smaller constructs as 

compared with the larger recombinants, and a series of expression constructs spanning the 
entire toxin B gene were constructed (l igures 18. 19 and 20 and Table 23). 

( onslrucis that expressed significant levels of recombinant toxin B protein (greater 
than I ing/Iiter intact recombinanl protein) in E. colt were identified and a .series of these 

20 clones that spans the toxin B gene are shown in Figure 19 and summarized in i ablc 23. 

I hese clones were utilized to isolate pure toxin B recombinanl protein from ihe entire loxin B 
gene. Significant protein yields were obtained from pM.AL. expression constructs spanning the 
entire toxin B gene, and yields of full length soluble fusion protein ranged from an estimated 
1 mg/liier culture (pMBl 100-1530) to greater than 20 mg/liter culture (pMB 1750-2360). 

25 Representative purifications of MBP and poiy-histidine-tagged toxin B recombinants 

are shown in I igures 21 and 22, figure 21 shows a Coomassie Blue stained 7.5% SDS- 
PAGL gel on which various protein samples extracted from bacteria harboring pMBl85()- 
2360 were electrophoresed. Samples were loaded as follows: fane I: protein extracted from 
uninduced culture; Lane 2: induced culture protein: Lane 3: total protein from induced culture 

30 after sonicalion: Lane 4: soluble protein: and Lane 5: eluted affinity purified protein, figure 
22 depicts the purification of recombinanl proteins expressed in bacteria harboring either 
pPB 1 850-2360 (Lanes 1-3) or pPBI 750-2360 (Lanes 4-6). Samples were loaded as follows: 
uninduced total protein (Isaacs 1 and 4): induced total protein (Lanes 2 and 5): and eluted 
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affinity purified protein (Lanes 3 and 6). The broad range molecular weight protein markers 
(BioRad) are shown in Lane 7. 

Thus, although high level expression was not attained using large expression constructs 
from the toxin B gene, usable levels of recombinant protein were obtained by isolating 
induced protein from a series of smaller pMAL expression constructs that span the entire 
toxin B gene. 

These results represent the first demonstration of the feasibility of expressing 
recombinant toxin B protein to high levels in E, coli. As wcIL expression ol small regions of 
the putative ligand binding domain (repeat region) ol toxin B as native protein yielded 
insoluble protein, while large constructs, or fusions to MBP were soluble (f igure 19), 
demonstrating that specific methodologies arc necessary to produce soluble fusion protein 
from this interval. 

iii) Clone Constructinn And Expression Details 

A portion of the toxin B gene containing the toxin B repeal region [amino acid 
residues 1X52-2362 of toxin B {SliQ ID NO:20)| was isolated by PCR amplification of this 
interxal of the toxin B gene from C". difficile genomic DNA. The sequence, and location 
within the toxin B gene, of the two PCR primers |P7 (SEQ ID NO: 13) and PK (SKQ ID 
NO: 14)1 used to amplify this region are shown in figure 18. 

DNA from the PCR amplification was purified by chloroform extraction and elhanol 
precipitation as described above. The DNA was restricted with .S/jcL and the cleaved DNA 
was rcsoUed by agarose gel electrophoresis, fhe restriction digestion with SpeX cleaved the 
3.6 kb amplification product into a 1.8 kb doublet band. I'his doublet band was cut Irom the 
gel and mixed with appropriately cut. gel purified pMALc or pFT23b vector. 1 hese vectors 
were prepared by digestion with Hind\\\. filling in the overhanging ends using the Klenow 
enzyme, and cleaving with Xhu\ (pMALc) or Nhe\ (pET23b). The gel purified DNA 
fragments were purified using the Prep-A-Gene kit (BioRad) and the DNA was iigated. 
transformed and putative recombinant clones analyzed by restriction mapping. 

pl*: f and pMal clones containing the toxin B repeat insert (aa interval 1750-2360 ol 
toxin B) were verified by restriction mapping, using enzymes that cleaved specific sites within 
the toxin B region. In both cases fusion of the toxin W Spc\ site with either the compatible 
Xha\ site (pMal) or compafible \'bc\ site (pET) is predicted to create an in frame fusion. This 
was confirmed in the case of the pMBl 750-2360 clone by DNA sequencing of the clone 
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junclion and 5' end of the toxin B insert using a MBP specific primer (New England 
Biolabs). In the case of the pET construct, the fusion of the blunt ended toxin B 3* end to 
the filled HincflU site should create an in-frame fusion with the C-terminal poly-histidine 
sequence in this vector. The pPB 1 750-2360 clone selected had lost, as predicted, the HinJlW 
S site at this clone junction: this eHniinated the possibility that an additional adenosine residue 
was added to the 3* end of the PCR product by a terminal transferase activity of the Pfh 
polymerase, since fusion of this adenosine residue to the filied HinJWl site vtould regenerate 
the restriction site (and was observed in several clones). 

One liter cultures of each expression construct were urowi). and fusion protein purified 

10 by affinity chromatography du either an amylose resin column (pMAL constructs: resin 

supplied by New England Biolabsl or Ni-chelatc column (pET constructs: resin supplied by 
Qiagen or Novageni as described (Williams et al. (1995), \upra\. The integrity and purity of 
the lusion proteins were determined by Coomassie staining of SDS-PACW: protein gels as well 
as Wesiern blot analysis with either an affinity purified goat polyclonal antiserum (Tech Lab). 

15 or a chicken polyclonal PEG prep, raised against the toxin B protein (CTB) as described 

above in Example 8. In both cases, affinity purification resulted in yields in excess of 20 mg 
protein per liter culture, of which greater than 90% was estimated to be full-lengih 
recombinant protein. It should be noted that the poly-hislidinc afilnity lauged protein was 
released from the Qiagen Ni-NTA resin at low imidazole concentration (60 mM). 

20 necessitating the use of a 40 mM imidazole rather than a 60 mM imidazole wash step during 
puri fication. 

A peripla.smically secreted version of pMB 1750-2360 was constructed by replacement 
ot the promoter and MBP coding region of this construct with that from a related vector 
tpMAL'^'-p2: New England Biolabs) in which a signal sequence is present at the N-terminus 

2? ot the MBP. such that fusion protein is exported. This was accomplished by substituting a 

ByilW-EcoKM promoter fragment from pMAL-p2 into pMBl 750-2360. The yields of secreted, 
affinity purified protein (recovered from osmotic shock extracts as described by Riggs in 
Ctf/ren/ Protocols in Molecular Biolo\^}\ Vol. 2. Ausubel. c/ a/., Eds. (1989), C urrent 
Protocols, pp. 16.6.1 - 16.6.14] from this vector (pMBpl 750-2360) were 6.5 mg/liter culture. 

30 of which 50% was estimated to be full-length fusion protein. 

fhe interval was aLso expressed in two non-overlapping iragments. pMB 1 750-1970 
was constructed by introduction of a frameshift into pMBl 750-2360. by restriction with 
//mcflll. filling in the overhanging ends and religation of the plasmid. Recombinant clones 
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were selected by loss of the HincJlU site, and further resiriciion map analysis. Recombinant 
^ protein expression from this vector was more than 20 mg/iilcr of urcater than 90% pure 
protein. 

The complementary region was expressed in pMB 1 970-2360. This construct was 
created by removal of the 1750-1970 interval of pMB 1750-2360. This was accomplished by 
restriction of this plasmid with EcoRl (in the pMalc polylinker 5* to the insert) and III. Ulling 
in the overhanging ends, and religation of the plasmid. The resultant plasmid, pMB 1970-2360, 
was made using both intracellularly and secreted versions of the piMB 1 750-2360 vector. 

No fusion protein was secreted in the pMBpl 970-2360 version, perhaps due to a 
conl'ormational constraint that prevents export of the fusion protein. However, the 
intracellularly expressed vector produced greater than 40mg/liter of greater than 90% full- 
length fusion protein. 

C onstructs to precisely express the toxin B repeals in cither pMalc (pMBl 850-2360) or 
pin 16h (pl'Bl850-2360) were constructed as follows. The DNA interval including the toxin 
B repeals was PCR amplified as described above utilizing PCR primers PI4 (SKQ \D NO:19) 
and 1*X (SKO ID NO: 14). Primer P14 adds a ficnlU sile immediately Hanking the start of the 
toxin H repeals.' 

The amplified fragment was cloned into the pTl Blue l -vcctor (Novauen) and 
recombinant clones in which single copies of the PCR fragment were inserted in either 
orientation were selected (p 171850-2360) and confirmed by restriction mapping. Phe insert 
was excised from two appropriately oriented independently isolated pr7 1 850-2360 plasmids. 
with Hl(>R\ (5' end of repeals) and Psil (in the flanking polylinker of the vector), and cloned 
into /:VoRl//*v/[ cleaved pMalc vector. The resulting construct (pMBl 850-2360) was 
confirmed by restriction analysis, and yielded 20 mg/l of .soluble fusion protein [greater than 
90% intact (i.e.. nondegraded)) after affinity chromatography. Restriction of this plasmid with 
Hhul\\\ and religation of the vector resulted in the removal of the 1970-2360 interval. The 
resultant construct (pMBl 850- 1970) expressed greater than 70 mg/liter of 90% lull length 
fusion proiein. 
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The pPBl 850-2360 construct was made by cloning a £cv?RI (filled with Klenow)- 
BamHl fragment from a pT7 1850-2360 vector (opposite orientation to that used in the 
pMBl 850-2360 construction) into NJel (fil!ed)/^a/wHI cleaved pET16b vector. Yields of 
affinily purified soluble fusion protein were 15 mg/hter. of greater than 90% full length 

' 5 fusion protein. 

Several smaller expression constructs from the 1750-2070 interval were also 
constructed in llis-taggcd pET vectors, but expression of these plasmids in the DL21 (DE3) 
host resulted in the production of high levels of mostly insoluble protein (sec Table 23 and 
Figure 19). These constructs were made as follows. 

10 pPBl850-l970 was constructed by cloning a Bj^lW-HincliU fragment of pPB 1850-2360 

into Bi^lW/Himnn cleaved pET23b vector. pPB 1 850-2070 was constructed by cloning a 
/^i,'/Il-/*v//Ii fragment of pPB 1850-2360 into B^lU/HiricW cleaved pnT23b vector. pPB1750- 
1970(c) was constructed by removal of the internal HiniHU iragment of a pPBl 750-2360 
\cctor ni which the vector HincflU site was regenerated during: cloning (presumably by the 

15 addition of an A residue lo ihe amplified PCR product by terminal transferase activity of 

polymerase). The pPBI750-l970(n) construct was made by insertion of the insert containing 
the SdeX'HiiuflW fragment of pPBl 750-2360 into identically cleaved pEl Hisb vector. Ail 
constructs were confirmed b> restriction digestion. 

An expression construct that directs expression of the 10-470 aa interval of toxin B 

20 was constructed in the pMalc vector as follows. A Nhe\ (a site 5' (o the insert in the pI:T23 
vector )-.4//ll (filled) fragment of the toxin B gene from pPBIO-1530 was cloned into Xha\ 
(compatible with \he\)IHhul\\\ (filled) pMaJc vector. The integrity of the construct (pMBlO- 
470) was verified by restriction mapping and DNA sequencing of the 5' clone junction using 
a MBP specific DNA primer (New England Biolabs). However, all expressed protein was 

25 degraded to the MBP monomer MW. 

A second construct spanning this interval (aa 10-470) was constructed by cloning the 
PCR amplification product from a reaction containing the P9 (SEQ ID NO:15) and PU) (SIiQ 
ID NO: 1 6) primers (Figure 18) into the pETHisa vector. This was accomplished by cloning 
the PCR product as an /lYvjRi -blunt fragment into EcaKX-HincW restricted vector DNA: 

30 recombinant clones were verified by restriction mapping. Although this construct (pPBIO- 

520) allowed expression and purification (utilizing the N-lerminal polyhistidine affinity tag) of 
intact fusion protein, yields were estimated at less than 500 (.ig per liter culture. 
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Higher yield of recombinant protein from this interval (aa 10-520) were obtained by 
expression of the interval in two overlapping clones. The lO-330aa interval was cloned in 
both pHTHisa and pMalc vectors, using the BamH]-AfI\\\ (filled) DNA fragment from pPBlO- 
520. This fragment was cloned into BamWl-HinciW] (filled) restricted pMalc or BamHl-IIindl 
restricted pETHisa vector. Recombinant clones were verified by restriction mapping. High 
level expression of either insoluble (pIiT) or soluble (pMal) fusion protein was obtained. Total 
yields of fusion protein from the pMBlO-330 construct (Figure IK) were 20 mg/liicr culture, 
of which 10% was estimated to be full-length fusion proiem. Although yields of this interval 
were higher and >90% full-length recombinant protein produced when expressed from the 
pET construct* the pMal fusion was utilized since the expressed protein was soluble and thus 
more likely to have the native conformation. 

The pMB260-520 clone was constructed by cloning IicoR\-Xhci\ cleaved amplified 
DNA from a PCR reaction containing the PI 1 (SEO 10 NO: 17) and PIO (SEQ ID N():16) 
DNA primers (Figure 18) into similarly restricted pMalc vector. Yields of affinity purified 
protein were 10 mg/liter. of which approximately 50% was estimated to be full-length 
recombinant protein. 

I'he aa510-l 1 10 interval was expressed as described below, fhis entire interval was 
expressed as a pMal fu.sion by cloning the NhcX-HindW] fragment of pi JCB 1 0-1 530 into Xha\- 
flimllU cleaved pMalc vector. The integrity of the construct (pMB51()-l 1 10) was verified by 
restriction mapping and DNA sequencing of the 5' clone junction using a MBP specific DNA 
primer. I he yield of affinity purified protein was 25 mg/liter culture, of which 5% was 
estimated i<i be full-length fusion protein (i mg/liter). 

To attempt to obtain higher yields, this region was expressed in two fragments (aa510- 
820. and 820^1 1 10) in the pMalc \ecior. The pMB5 10-820 clone was constructed by 
insertion of a Sad (in the pMalc polylinkcr 5* to the insert )-///?t/l DNA fragment from 
pMB510-l 1 10 into SacVSttd restricted pMalc vector. The pMB820-l 1 10 vector was 
constructed by insertion of the Hpal-flinMl fragment of plICBl()-1530 into BamHl (filled)/ 
Him/\U cleaved pMalc vector, fhe integrity of these constructs were verified by restriction 
mapping and DNA sequencing of the 5' clone junction using a MBP specific DNA primer. 

Recombinant protein expressed from the pMB5 10-820 vector was highly unstable. 
However, high levels (20 mg/liter) of >9()% full-length fusion protein were obtained from the 
pMB820-l 105 construct. I he combination of partially degraded pMB510-l 1 10 protein 
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(enriched for the 510-820 interval) with the pMB820-l 1 10 protein provides usable amounts of 
recombinant antigen from this interval. 

The aal 100-1750 interval was expressed as described below. The entire interval was 
expressed in the pMaic vector from a construct in which the AccKfiWcdySpcl fragment of 
5 pPBl 0-1730 was inserted into Snt\fXha\ (Xbal is compatible widi Spc\\ Sfitl and filled Accl 
siics are boih blunt ended) restricted pMalc. The integrity of this construct (pMBI 100-1750) 
was verified by restriction mapping and DNA sequencing of the clone junction using a MBP 
specific DNA primer. Although 15 mg/litcr of affinity purified protein was isolated from cells 
harboring this construct, the protein was greater than 99% degraded lo MBP monomer size. 
10 A smaller derivative of pMBl 100-1750 was constructed by restriction of pMBl 100- 

1750 with AflU and Sail (in the pMalc polylinker 3' to the insert), filling in the overhanging 
ends, and reiigaiing llie plasmid. The resultant clone (verified by restriction digestion and 
DNA sequencing* has deleted ihe aal530-I750 iniervaL was designated pMl^l 100-1530. 
pMBl 100-1530 expressed recombinant proiein at a yield of greater than 40 mg/liier. of which 
15 5% was estimated lo be full-length fusion protein. 

Three construcis were made to express the remaining interval. Initially, a B.vpHl 
(filled )-A/)eI fragment from pPBlO 1750 was cloned into £'cf>RIffilled)/A7>£/l cleaved pMale 
vector. I he integrity of this construct (pMB 1570- 1750) was verified by restriction mapping 
and DNA sequencing of the 5* clone junction using a MBP specific DNA primer. Hxpression 
20 nt recombinant protein from this plasmid was very low. approximately 3 mg affinity purified 
proiein per liter, and most was degraded to MBP monomer size. This region was 
subsequently expressed from a PCR amplified DNA fragment. A PCR reaction utilizing 
primers 1M3 |SEO ID NO; 18: PI 3 was engineered lo introduce an EcoRl site 5* to amplified 
loxin 13 sequencesi and P8 (SHQ ID NO: 14) was performed on i \ difficiic genomic DNA as 
25 described alxwe. The amplified fragment was cleaved with Ec7>R[ and Spe\. and cloned into 
liaAU/Xhal cleaved pMalc vector. The resultant clone (pMB 1 530-1 750) was verified by 
restriction map analysis, and recombinant protein was expressed and purified. I he yield was 
greater than 20 mg protein per liter culture and it was estimated that 25% was full-length 
fusion proiein: this was a significantly higher yield than the original pMBI570-l750 clone, 
>0 The inserl of pMB 1 530- 1 750 (in a EcoRl-Sall fragment) wiis transferred lo the pHTHisa 
vector {EcoRUXhol cleaved. Xhn\ and Sail ends are compatible). No detectable fusion 
proiein was purified on Ni-Chelate columns from soluble lysates of cells induced to express 
iusion proiein from this construct. 
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TABLE 23 

Summary Of Toxin B Expression Construcis' 



Clone 


Affinity Tati 


Yield (mg/liter) 


'*o Tull Length 


pPBlO-l7SO 


none 


low (estimated from Western 
Oloi n\ D. ) 


•i 


pPBl 0-1530 


none 


low (ns above > 


'> 


pMB 10-470 


MBP 


I5mi£/I 


0% 


pPB 10-520 


poly-his 


0.5mu'l 


20% 


pPBIO-330 


poly-his 


■20me/l (in?;oluble> 


90% 


nMiifO-330 


MBP 








MBP 






nMi35io-n}() 


MBP 




5% 


pMB5 10-820 


MBP 


degraded (by Western blot 

ii\ n) 










Vlf.Up 


ptVl tJ I 1 \ni~ 1 /r>u 


KAOX) 
IVlDr 


I5mii/I 


IV*/ 
U /o 




Mat 






pivi D 1 T /IJ- 1 / Til 


ivlUr 


-^>mi!' 1 


7* n 


pro 1 r^.iU- 1 /?\t 


poly-his 


no purified protein detected 


•» 


fi.i/tS f ^Jt/- 1 j(f 


\AfiP 
iVItSI 




-> ^f»/ 

_ J /O 


p.Mtff y()~2S(m 


MB/ 






pMBp 1 7>0-2j60 


MBP 


6.5mg/l (secreted) 




pI'B l7>0-2->60 


poly-his 


•20mi;/l 


.nno/ 


pML) 1 OU- 1 V/i; 


ivitir 


-lUiIlli.'' 1 


vw /o 


[jiviri 1 i f\i~ >ov 


Mni> 

iviol 


40m!Z.' t 


90% 


oMBnl970-"'j60 


MBP 


{no secretion > 


NA 


pMBIK50-2560 


MBP 


20mii/l 


90% 


pPB 1850-2360 


poly-his 


ISmii'l 


^)0% 


pMB 1850- 1970 


MBP 


70ma/l 


90% 


pPB18S0-l*)70 


poly-his 


lOmg'l (insoluble) 


90<V'o 


pPB 1 850-2070 


poly-his 


•lOmi^/l (insoluble) 


90% 


pPB 1750- 1970(c) 


poly-his 


lOniii/l (insoluble) 


■90*V» 


pPBl750-l970(n) 


poly-his 


"lOmti/l ( insoluble) 


90% 



Clones in ilalics arc clones currently utilized lo purify recombinant protein from each selected 
interval. 
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occurs with the CTB antibody-recombinant mixture, that recombinant contains only weak or 
non-ncuiraiizing epitopes of toxin B. This assay was performed as follows. 

Antibodies against CTB were generated in egg laying Leghorn hens as described in 
Example 8. The IcthaJ dosage (LD ^^^^^) of C. difficile toxin B when delivered I. P. into 40u 
5 female Golden Syrian hamsters (Charles River) was determined to be 2.5 to 5 ^g. Antibodies 
generated against CTB and purified by Pt!G precipitation could completely protect the 
hamsters at the LP. dosage determined above. The minima! amount of CTB antibody needed 
to afford good protection against 5 j.tg of CTB when injected LP. into hamsters was also 
determined ( IX PEG prep). These experiments defined the parameters needed lo lest whether 

to a given recombinant protein could deplete protective CTB antibodies. 

fhe cloned regions tested for neutralizing ability cover the entire toxin B gene and 
were designated as Intervals (INT) 1 through 5 (see Figure IQ). Approximately cquivaieni 
final concentrations of each recombinant polypeptide were tested, Lhe following recombinant 
polypeptides were used: I ) a mixture of intervals I and 2 (INT-I. 2); 2) a mixture of 

15 Intervals 4 and 5 (INT-4. 5) and 3) Interval 3 (INT-3). Recombinant proteins (each at about 
I mg total protein) were first prcincubaicd with a final C FB antibody concentration of IX 
|/.c^. pellet dissolved in original yolk volume as described in Example l(cl| in a final volume 
of 5 mis for I hour at 37°C- Twenty-five jig of CTB (at a concentration of 5 |^ig/ml). enough 
CTB lo kill 5 hamsters, was then added and the mixture was then incubated for 1 hour at 

20 37^C. [ ive, 40g female hamsters (Charles River) in each treatment group were then each 

given I ml of the mixture LP. using a tuberculin syringe with a 27 gauge needle. The results 
of this experiment are shown in Table 24. 



TABLE 24 

Binding Of NeiitraJizing Antibodies By INT 3 Protein 



Treatmcn! Group' 


Numl>er Of Animals Alive 


Number Of Animals Dead 


CTB antibodies 


3 




CTB antibodies ^ 1NTL2 


3 


-} 


CTB antibodies * 1NT4.5 


J 


t 


CTB antibodies + INT 3 


0 





' ( \ liiffkik' loxin 13 (CTB) was added lo each group. 

As shown in Table 24. the addition of recombinant proteins from INT-L 2 or INT-4, 5 
had no effect on the in vivo protective ability of the CTB antibody preparation compared to 
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EXAMPLE 19 

Identification. Purification And Induction Of Neutralizing 
Antibodies Against Recombinant C. difficile Toxin B Protein 



5 I o determine whether recombinant toxin B polypeptide fragments can generate 

neutralizing antibodies, typically animals would first be immunized with recombinant proteins 
and anli-recombinani antibodies arc generated, I hese anti-recombinanl protein antibodies are 
then tested for neutralizing ability in vivo or in vino. Depending on the immunogenic nature 
of the recombinant polypeptide, the generation of high-tiler antibodies against that protein 

10 may take several months. To accelerate this process and identify which recombinant 

polypeptidc(s) may be the best candidate to generate neutralizing antibodies, depletion studies 
were |x;rformed. Specifically, recombinant toxin B polypeptide were pre-screened by testing 
whether they have the ability to bind to protective antibodies from a CTB antibody / 
preparation and hence deplete those antibodies of their neutralizing capacity. Those 

15 recombinant polypeptides iound to bind CTB. were then utilized to generate neutralizing 

antibodies, fhis lixample involved: a) identification of recombinant sub-regions within loxin 
B to which neutralizing antibodies bind; b) idemiilcaiion of toxin B sub-region specific 
antibt>dics that neutralize loxin B in viva: and c) generation and evaluation i>f antibodies 
reactive to recombinant loxin B polypeptides. 

20 

a) Identification Of Recombinant Sub-Regions Within Toxin B 
To Which Neutraii/in^ Antibodies Bind 

Sub-regions within toxin B to which neutralizing antibodies bind were identified by 
utilizing recombinant toxin B proteins to deplete protective antibodies from a polyclonal pool 

25 of antibodies against native (* ciifftcilc loxin B, An in vivo assay was developed to evaluate 
protein preparations for the ability to bind neutralizing antibodies. Recombinant proteins were 
first pre-mixed with antibodies directed against native toxin B (CTB antibody; see Example 8) 
and allowed to react for one hour at 37°C. Subsequently. C. tlifficilc loxin B (CTB; Tech 
[.ab) was added at a concentration lethal to hamsters and incubated for another hour at 37°C- 

30 After incubation this mixture was injected intraperitoneal iy (IP) into hamsters. If the 

recombinant polypeptide contains neutralizing epitopes, the CTB antibodies will lose its 
ability to protect the hamsters against death from CTB. If partial or complete protection 
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the CTB antibody preparation alone. In contrast, INT-3 recombinant polypeptide was able to 
remove all ol ihe toxin B neutralizing ability of the CTB antibodies as demonstrated by the 
death of all the hamsters in that group. 

The above experiment was repeated, using two smaller expressed fragments (pMB 
5 1750-1970 and pMB 1970-2360. see Figure 19) comprising the INT-3 domain to determine if 
that domain could be further subdivided into smaller neutralizing epitopes. In addition, full- 
length lNT-3 polypeptide expressed as a nickel tagged protein (pPBl 750-2360) was tested for 
neutralizing ability and compared to the original INT-3 expressed MBP fusion (pMB1750- 
2360). I bc results arc shown in Table 25. 

10 TABLE 25 



Removal Ol' Netitralizini: Antibodies By Repeal Containing Proteins 



TreaUTieni Group' 


Number Of Animals Alive 


Number Of Animals Dead 


C PB antibodies 


5 


0 


CTB ancibtHlies * pPB 1 750-2360 


0 




CTB iMUibodies * pMBI750-2360 


0 




CTB antibodies • pMBI970-2360 


3 


T 


CTB antibodies * pMB 1750-1970 


2 





( Jfffk ik- loxin (CTB) was added to each iiroup. 

20 

T he results summarized in Table 25 indicate thai the smaller polypeptide fragments 
within ihe IN 1-3 domain. pMBI75()-l970 and pMB1970-2360. partially lose the ahilily to 
bind to and remove neutralizing antibodies from the CTB antibody pool. T hese results 
demonstrate that the full length INT-3 polypeptide is required to completely deplete the CTB 

25 antibody pool of neutralizing antibodies. This experiment also shows that the neutralization 
epitope of lNT-3 can be expressed in alternative vector systems and the results are 
independent of the vector utilized or the accompanying fusion partner. 

Other Interval 3 specific proteins were subsequently tested ibr the ability to remove 
neutralizing antibodies within the CTB antibody pool as described abt>ve. The Interval 3 

30 specific proteins used in these studies are summarized in Figure 23. In Figure 23 the 

following abbreviations are used: pP refers to the pET23 vector: pM refers to the pMALc 
vector: B refers to toxin B: the numbers refer to the amino acid interval expressed in the 
clone. I he solid black ovals represent the MBP; and MHH represents the poly-histidine tag. 
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Only recombinuni proteins comprising the entire toxin B repeat domain (pMB1750- 
2360, pPB 1750-2360 and pPB 1850-2360) can bind and completely remove neutralizing 
antibodies from the CTB antibody pool. Recombinant proteins comprising onlv a portion of 
the toxin B repeat domain were not capable of completely removing neutralizing antibodies 
5 from the CTB antibody pool (pMBl 750-1970 and pMBl 970-2360 could partially remove 
neutralizing antibodies while pMB 1850- 1970 and pPB 1850-2070 failed to remove any 
neutralizing antibodies from the CTB antibody pool). 

1 he above results demonstrate thai only the complete ligand binding domain (repeal 
region) of the toxin B gene can bind and completely rcmoxe neutralizing antibodies from the 
10 CTB antibody poof These results demonstrate that antibodies directed against the entire toxin 
\l repeal region are necessary for in vivo toxin neutralization (see Figure 23: onlv the 
recombinant proteins expressed by the pMBl 750-2360. pPB 1750-2360 and pPB 1 850-2360 
vectors are capable of completely removing the neutralizing antibodies from the C I B 
antibody pool). 

i > These results represent the first indication that the entire repeat region of toxin B 

would be necessary for the generation of antibodies capable of neutralizing u>\in B. and that 
sub-regions may not be sufficient to generate maximal tilers of neutralizing antibodies. 

b) Identification Of Toxin B Sub-Region Specific- Antibodies 

20 That Neutralize Toxin B In Vivo 

l o determine if antibodies directed against the toxin B repeat region are sufficient for 
neutralization, region specific antibodies within the C^'fB antibody preparation were affinitv 
purified, and tested for in vivo neulralizxilion. Affinity columns containing recombmanl toxin 
B repeal proteins were made as described below. A separate affinity column was prepared 

25 using each of the following recombinant loxin B repeal proteins: pPIi 1 750-2360. pPB1850- 
2360. pMB1750-1970 and pMB1970-2360. 

l or each affinity column to l:>e made, four ml of PBS-washed Aciigel resin (Slerogene) 
was coupled overnight at room temperature with 5-10 mg of affinity purified recombinant 
protein (first extensively dialyzed into PBS) in 15 ml tubes (falcon) containing a l/tO final 

30 \oIume Aid-coupling st)lution (1 M sodium cyanoborohydride). Aliquots of the supernatants 
Irom the coupling reactions, before and after coupling, were assessed by Cooma.ssic staining 
of 7.5% SDS-PACjE gels. Based on protein band intensities, in all cases greater than 30% 
coupling etficiencies were estimated. The resins were poured into 10 ml columns (BioRad), 
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washed extensively with PBS, pre-eluted with 4M guanidine-HCl (in 10 mM rris-HCK pH 
8.0) and reequilibrated in PBS. The columns were stored ai 4°C. 

Aliquois of a CTB IgY polyclonal antibody preparation (PEG prep) were affinity 
purified on each of the four columns as described below. The columns were hooked to a IJV 
- 5 monitor (I SCO), washed with PBS and 40 ml aliquots of a 2X PEG prep (filter sterilized 
usinu a 0.45 |.i filter) were applied. The columns were washed with PBS until the baseline 
value was re-established. The columns were then washed with BBStween to clute 
nonspecifieally binding antibodies, and reequilibrated with PBS. Bound antibody was eluted 
from the column in 4M guanidinc-HCl (in lOmM Tris-HCL pH8.0). The eluted antibody was 

10 immediately dialyzed against a 1 00- fold excess of PBS at 4°C for 2 hrs. The samples were 

then dialyzed extensively against at least 2 changes of PBS. and affinity purified antibody was 
colleeicd and stored at 4"C. I he antibody preparations were quantified by IJV absorbancc. 
fhe cluiion \olumes were in the range of 4-8 ml. All affinity purified slocks contained 
similar loial anfibody concentrations, ranging from 0.25-0.35% of the total protein applied to 

15 the columns. 

l iie ability of the affinit) purified antibody preparations to neutralize toxin B in vivo 
was determined using the assay outlined in a) above. Affinity purified antibody was diluted 
1:1 in PBS before testing. The results arc shown in Table 26. 

In all cases similar levels of toxin neutralization was observed, such that lethality was 
20 delayed in all groups relative to prcimmune controls. This result demonstrates that antibodies 
reactive to the repeat region of the toxin B gene are sufficient to neutralize toxin B in vivo. 
The hamsters will eventually die in all groups, but this death is maximally delayed with the 
CTU PEXi prep antibodies. Thus neutralization with the affinity purified (AP) antibodies is 
not as complete as that observed with the CTB prep before atfinity chromatography. This 
25 result may be due to loss of activity during guanidine denaiuration (during the elution of the 
antibodies from the affinity column) or the presence of antibodies specific to other regions of 
the toxin B gene that can contribute to toxin neutralization (present in the CTB PEG prep). 
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TABLE 26 

Neutralization Of Toxin B By AtTinily Purified Amibodies 



Treatment group" 


Number Animals Alive^ 


Number Animals Dead*" 


Preimmunc* 


0 


5 


CTB': 400 ng 


5 


0 


CTB (AP on pPB 1750-2360):- 87fi 


5 


0 


CTB (AP on pMBI750-l970)r 875 Mg 




0 


CTB (AP on pMB 1970-2360):- 500 pc 


5 


0 



(' Jiffkile toxin B (CTB) (Tech Lab: ai 5 pg/ml. 25 \\» total) a! lethal concentration lo 
hamsters is added to antibody and incubated for one hour ai 37"C. Alter incubation this 
mixture is injected intraperitoneal ly (IP) into hamsters. Kach treatnicni group received toxin 
premixed with antibody raised against the indicated protein, as either: '4X aniibody PEG prep 
or atTmily puritlcd (AP) antibody (from CTB PEG prep, on indicated columns). The amount 
of specific antibody in each prep is indicated: the amount is directly determined for affmity 
purified preps and is estimated lor the 4X C FB as described in Example 15. 

I hc numbers in each group represent numbers of hamsters dead or ahve. 2 hr post IP 
administration of toxin/antibody mixture. 

I hc observation lhat antibodies aftlnity purified against the non-overlappinu 
pMBI 750-1070 and pMB1970-2360 proteins neutralized toxin ii raised the possibility lhat 
either 1 ) antibodies specific to repeat sub-regions are sufficient to nculraiize toxin B or 2) 
sub-region specific proteins can bind rnost or all repeat specific antibodies present in the CTB 
polyclt>nai pool. This would likely be due to conformational similarities between repeats, 
since homology in the primary amino acid sequences between different repeats is in the range 
of only 25-75% [Ilichel-Strciber. ct ai. (1992) Molec. CkMi. Cienetics 233:2601. i hese 
possibilities were tested by affinity chromatography. 

The CTB PtG prep was sequentially depleted 2X on the pMBl750-l970 column: only 
a small eluiion peak was obserxed after the second chromatography, indicating thai most 
reactive antibodies were removed. This interval depleted C'fR preparation was then 
chromaiographed on the pPB 1 850-2360 column: no antibody bound to the column. I he 
reactivity of the CTB and CTB (pMB1750-l970 depleted) preps to pPBl750-2360. pPB1850- 
2360. pN4Bl750-]970 and pMBI970-2360 proteins was then determined by l-LISA using the 
protocol described in Hxample 13(c). Brietly. 96-well microliter plates (l aleon, Pro-Bind 
Assay Plates) were coated with recombinant protein by adding 10() pi volumes of protein at 
1-2 |ag/ml in PBS containing 0.005% thimerosal to each well and incubating overnight at 4°C. 
1 he next morning, the coating suspensions were decanted and the wells were washed three 
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limes using PBS. In order to bJock non-specific binding sites. 100 ^! of 1.0% BSA (Sigma) 
in PBS {blocking solution) was then added to each well, and the plates were incubated for 1 
hr. at 37%\ The blocking solution was decanted and duplicate samples of 150 ^il of diluted 
antibody was added to the first well of a dilution series. The initial testing serum dilution 
5 was (1/200 for CTB prep, {the concentration of depleted CTB was standardized by OD.jj^) in 
blocking solution containing 0.5% Tween 20. followed by 5-fold serial dilutions into this 
solution. This was accomplished by serially transferring 30 ^1 aliquots to 120 f.tl buffer, 
mixing, and repeating the dilution into a fresh well. After the final dilution. 30 ul was 
rcmox cd from the well such that all wells contained 120 final volume. A total of 5 such 

10 dilutions were performed (4 wells total). The plates were incubated for I hr at 37°C. 

following this incubation, the serially diluted samples were decanted and the wells were 
washed three limes using PBS containing 0.5% Tween 20 (PBST). followed by two 5 min 
washes using BBS-Tween and a final three washes using PBST. To each well. 100 |.d of 
I 1000 diluted secondary antibody [rabbit anti-chicken IgCi alkaline phosphatase (Sigma) 

15 diluted in blocking solution containing 0.5% Tween 20| was added, and the plate was 

incubated 1 hr at 37°C. The conjugate solutions were decanted and the plates were washed 6 
times in PBSl . then once in 50 mM NaXO;. 10 mM MgCK. pH 9.5. The plates were 
develt>ped by ihc addition of 100 pi of a solution containing I mg/ml para-nitro phenyl 
phosphate (Sigma) dissolved in 50 mM NaXO^. 10 mM MgCL. pH9.5 to each welL 1 he 

20 plules were then incubated at room temperature in the dark for 5-45 min. The absorbency of 
each well was measured at 410 nm using a Dynatech MR 700 plate reader. 

As predicted by the affinity chromatography results, depiction of the CTB prep on the 
pMBl750-1970 column removed all delectable reactivity to the pMB 1970-2360 protein. The 
reciprocal purification of a C I B prep that was depleted on the pMB 1970-2360 column 

25 yielded no bound antibody when ehromaiographed on the pMB1750-l970 column, fhe.se 

results demonstrate that all repeal reactive antibodies in the CTB polyclonal pool recognize a 
conserved structure that is present in non-overlapping repeats. Although it is possible that this 
conserved structure represents rare conserved linear epitopes, it appears more likely that the 
neutralizing antibodies recognize a specific protein conformation. This conclusion was also 

30 suggested b\ the results of Western blot hybridization analysis of CTB reactivity to these 
recombinant proteins. 

Western blots of 7.5% SDS-PACjB gels, loaded and clecirophoresed with defined 
qitaniiiies of each recombinant protein, were probed with the CTB polyclonal antibody 
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preparation. The blots were prepared and developed with alkaline phosphatase as described in 
Example 3. The results are shown in Figure 24. 

Figure 24 depicts a comparison of immunoreaciiviiy of IgY antibody raised against 
either native or recombinant toxin B antigen. Rquai amounts of pMB 1750- 1070 (lane I), 
5 pMB 1 970-2360 (lane 2), pPB 1850-2360 (lane 3) as well as a serial dilution of pPB 1750-2360 
(lanes 4-6 comprising IX. 1/lOX and 1/lOOX amounts, respectively) proteins were loaded in 
duplicate and resolved on a 7.5% SDS-PAGH gel. The gel was blotted and each half was 
hybridized with PEG prep IgY antibodies from chickens immunized with either native CTB or 
pPB 1750-2360 protein. Note that the full-length pMB1750-l970 protein was identified only 

H) by antibodies reactive to the recombinant protein (arrows). 

Although the CTB prep reacts with the pPBl 750-2360. pPB 1 850-2360. and pMB1970- 
2360 proteins, no reactivity to the pMBI 750-1970 protein was observed (l*igure 24). (iiven 
that all repeat reactive antibodies can be bound by this protein during affinity 
chromatography, this result indicates that the protein cannot fold properly on Western blots. 

15 Since this eliminates all antibody reactivity, it is unlikely that the repeat reactive antibodies in 
the CTB prep recognize linear epitopes. This may indicate that in order to induce protective 
antibodies- recombinant toxin B protein will need to be properly folded. 

e) (feneration And Evaluation Of Antibodies Reactive To 
20 Recombinant Toxin B Polypeptides 

i) Generation Of Antibodies Reactive To 
Recombinant Toxin B Proteins 

Antibodies against recombinant proteins were generated in egg laying Leghorn hens as 
described in Example 13. Antibodies were raised (using Freunds adjuvant (Gibco) unless 

25 otherwise indicated | against the following recombinant proteins: I) a mixture of Interval 1+2 
proteins (see Figure 18); 2) a mixture of interval 4 and 5 proteins (see Figure 18); 3) 
pMB 1 970-2360 protein; 4) pPBI 750-2360 protein: 5) pMB! 750-2360: 6) pMB1750-2360 
[Tilermax adjuvant (Vaxcell)]; 7) pMBI 750-2360 [Gerbu adjuvant (Biotech)]: 8) pMBpl750- 
2360 protein: 9) pPBI85O-2360: and 10) pMB1850-2360. 

30 Chickens were boosted at least 3 times with recombinant protein until ELISA 

reactivity (using the protocol described in b) above with the exception that the plates were 
coated with pPB 1750-2360 protein] of polyclonal PEG preps was at least equal to that of the 
CTB polyclonal antibody PEG prep. ELISA titers were determined for the PFCj preps from 
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all of the above immunogens and were found to be comparable ranging from 1:12500 to 
1:62500. High liters were achieved in all cases except in 6) pMBl 750-2360 in which strong 
liters were not observed using the Titermax adjuvant, and this. preparation was not tested 
further. 

s 

ii) FA'aluation Of Antibodies Reactive To 
Recombinant Proteins By Western Blot 
Hybridization 

Western blois of 7.5% SDS-PAGG gels, loaded and eleclrophorcscd with defined 

10 quantities of recombinant protein (pMBl750-i970, pPBl850-2360. and pMBl970-2360 

proteins and a serial dilution of the pPBl 750-2360 to allow quantiticaiion of reactivity), were 
probed uiih the CTB, pPB 1 750-2360. pMB 1 750-2360 and pMB 1970-2360 polyclonal 
antibody preparations (trom chickens immunized using Frcunds adjuvant). The blots were 
prepared and developed wiih alkaline phosphatase as described above in b). 

15 As .shown in 1 igure 24. the CTB and pMB 1970-2360 preps reacted strongly with the 

pPBI 750-2360. pPBI 850-2360, and pV[B1970-2360 proteins while the pPB1750-2360 and 
pMB 1 970-2360 ((lerbu) preparations reacted strongly with all four proteins. The Western 
blot reactivity of the pPBI 750-2360 and pMB1970-2360 (Gerbu) preparations were equivalent 
to that of the CTB preparation, while reactivity of the pMB1970-2360 preparation was <10% 

20 that oj ihe CTB prep. Despite equivalent ELISA reactivities only weak reactivity 

(approximaiely 1%) to ihc recombinant proteins were observed in PEG preps from two 
independeni groups immunized with the pMB1750-2360 protein and one group immunized 
wuh ihe pMBl 750-2360 preparation using Freunds adjuvant. 

Affuiil} purification was utilized to determine if this difference in immunoreaciiviiy by 

25 Western blot analysis rellccis differing antibody liters. Fifty mi 2X PEG preparations from 
chickens immunized with either pMB 1 750-2360 or pMB 1970-2360 protein were 
chromaiographed on the pPB 1750-2360 affinity column from b) above, as described. The 
yield of affinity purified antibody {% total protein in preparation) was equivalent to the yield 
obtained from a CTB PFCi preparation in b) above. Thus, differences in Western reaclivity 

30 relleci a qualitative dilTerencc in the antibody pools, rather than quantitative differences.. 

fhe.se results demonstrate that certain recombinant proteins are more effective at generating 
high affinity antibodies (as as-sayed by Western blot hybridization). 
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iii) In Vivo Neutralization Of Toxin B Using 

Antibodies Reactive To Recombinant Protein 

in vivo hamster model [described in Examples 9 and 14(b)] was utilized lo assess 
the neutralizing ability oi' antibodies raised against recombinant toxin B proteins. The results 
Ironi three experiments are shown below in Tables 27-29. 

The ability of each immunogen to neutralize toxin B in vivo has been compiled and is 
sliown in l able 30. As predicted from the recombinant proiein-CTB premix studies (Table 
24) only antibodies to Interval 3 (1750-2366) and not the other regions of toxin B {i.e.. 
intervals 1-5) are protective. Unexpectedly, antibodies generated to INT-3 region expressed in 
pMAL vector (pMB 1750-2360 and pMpB 1 750-2360) using Freunds adjuvant were non- 
neutralizing. This observation is reproducible, since no neutralization was observed in two 
independent immunizations with pMB 1 750-2360 and one immunization with pMpBI750- 
2360. I'hc I'act that 5X quantities of affinity purified toxin B repeat specific antibodies from 
pMBl 750-2360 PliG preps cannot neutralize toxin B while IX quantities of afllniiy purified 
anti-CTB antibodies can (Table 28) demonstrates that the differential ability ut C I B 
antibodies lo neutralize toxin B is due to qualitative rather than quantitative differences in 
these antibody preparations. Only when this region was expressed in an alternative vector 
(pPB I 750-2360) or using an alternaiive adjuvant with the pMBl 750-2360 protein were 
neutralizing antibodies generated. Importantly, antibodies raised using freunds adjuvant to 
pPB 1 850-2360. which contains a fragment thai is only 100 amino acids smaller than 
recombinant pPB 1 750-2360. are unable to neutralize toxin fi in vivo (Tabic 27): note also that 
the same vector is used for both pPB 1850-2360 and pPB 1 750-2360- 
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TABLE 27 

In Viva Neutralization Of Toxin B 



Treatment Clroiin'* 


Niimhpr Animnlc Ativp'' 


NJllinKpr Animtfc rii»orl^ 


Preimmune 


0 


S 


CTB 


5 


0 


INTI+2 


0 


5 


INT 4^5 


0 


5 


pMB 1750-2360 


0 


s 


pMB 1970-3360 


0 


5 


pPB 1750-2360 


5 


0 



( . iiifficih toxin B (CTB) (at 5 y\^m\\ 25 ^ii; total; Tech Lab) ai lethal concentration to 
hamsters is added to antibody and incubated for one hour at 37^*0. After incubation this 
mixture is injected intraperitoneal ly (IP) into hamsters. Tach treatment group received toxin 
premised with antibody raised aiiainst the indicated protein, as a 4X antibody PtG prep. 

The numbers in each group represent numbers of hamsters dead or alive. 2 hours post IP 
administration of loxin/antibody mixture- 



TABLE 28 

In \'t\o Nemraiizalion Of Toxin B Using AtTmiiy Purified Antibodies 



Treat nieni Ciroup^ 


Number Animals Alive** 


Number Animals Dead** 


Preimmune( I) 


0 


5 


CTB(l) 


5 


0 


pPBl750-2360(l) 


5 


0 


15 mi! anii-pMB 1750-2360(2) 


I 


4 


15 nii! anIi-pMBI970-2360(2> 


0 


5 


300 \iyL anti-CTBf2) 


5 


0 



(*. citffkitc toxin B (CTB) (at 5 mi?''"': 25 ioial:Tech l^b) at lethal concentration tt* 
hamsters is added to antibody and incubated lor one hour at 37**C. After incubation. I ml of 
this mixture is injected intraperitoneal ly (IP) into hamsters. Each treatment group received 
toxin preinixed with antibody raised auainst the indicated protein, as either ( I) 4X antibody 
PEG prep or (2^ affmity purified aniibody (on a pPB 1 750-2360 resin), either 1.5 mti/group 
(anti-pMB 1 750-2360 and anti-pMBr')70-2360: used undiluted aBlnity puritled antibody) or 350 
Mil group (anti-CTB. repeat spcciUci used 1/5 diluted anti-CTB antibody). 

The numbers in each group represent numbers of hamsters dead or alive. 2 hr post-IP 
administration of toxin/ antibody mixture. 
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TABLE 29 

Generation Of Neulralizini; Antibodies Utilizing Tlic Gcrbu Adjuvant 



Treatment Groups 


Number Animals Alive*' 


Number Animals Dead*" 


Preimmune 


0 


5 


CTB 


5 


0 


pMB 1970-2360 


0 


5 


pMB 1850-2360 


0 


5 


pPB 1850-2360 


0 


5 


pMB 1 750-2360 (Gerbu adj) 


5 


0 



( liifficilc toxin B (CTB) (Tech Lab) at lethal conccniraiion to hamsiers is added to anlibodv 
and incubated tor one hour at 37^*0. After incubation thi?; mixture is injected iniraperitoneatly 
(IP) into hamsters, t-ach treatment iiroup received toxin preniixed \Mih antibody raised againsi 
the indicated protein, as a 4X antibody REG prep. 



Hie numbers in each uroup represent numbers ol hamsters dead or alive. 2hrs post IP 
.idinmisiration ot toxin ant ibo<.iy niiMure. 
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TABLE 30 

in Vivo Neutralization Of Toxin B 



Iinmunoiicii 


Adjuvant 


Tested Preparation"' 


Antigen 
Utilized For AP 


In vivo 
Neutralization'* 


IVeimmune 


NA' 


PEG 


NA 


no 


C TB (nmivc) 


Titermax 


PEG 


NA 


yes 


C TB (naiivct 


1 iiermax 


AP 


pPB 1750-2360 


yes 


CTU (native ) 


Tilermax 


AP 


pPB 1850-2360 


yes 


C TB (native) 


Titermav 


AP 


pPBI750-!970 


yes 


CTB fnaiivet 


Titermax 


AP 


pPB 1970-2360 


yes 


pMB 1750 2360 


Frcunds 


PEG 


NA 


no 


pMB! 750^2360 


Freunds 


AP 


pPB 1750-2360 


no 


pMB 1750-2360 


Gerbii 


PEG 


NA 


yes 


pMB 1970-2360 


Frcunds 


PEG 


NA 


no 


pMB 1970-2360 


Frcunds 


AP 


pPB 1 750-2360 


no 


pI'B 1750-2360 


Freunds 


PECi 


NA 


yes 


pl*BlRM)-2360 


I'veunds 


PEG 


NA 


no 


pMB 1850-2360 


Frcunds 


PEG 


NA 


no 


INT 1 -2 


Freui\ds 


PEG 


NA 


no 


INT 4 -5 


Frcunds 


PEG 


NA 


nil 



lliiher PFG preparation {PEGi or attmiiy puriEicd antibodies (AP). 

*\'cs' denotes complete neutralization (0 5 dead* while no' denotes no ncuiralizniitm (5 5 dead) 
ol' toxin B. 2 hours post-administration of mixture. 

25 

N A denotes not applicable. 

The pPB 1750-2360 antibody pool confers significant in vivo protection, equivalent to 
that obtained with the affinity purified CTB antibodies. This correlates with the observed high 
affinity of this antibody pool (relative to the pMBl 750-2360 or pMB1970-2360 pt)ols) as 
30 assayed by Western blot analysis (l*igure 24). These results provide the In si demonstration 
that in vivo neutralizing antibodies can be induced usinu recombinant toxin B protein as 
immunogen. 

The failure of high concentrations of antibodies raised against the pMBl 750-2360 
protein (using I'reunds adjuvant) to neutralize, while the use of Gerbu adjuvant and 
35 pMBl 750-2360 protein generates a neutralizing response, dcmonslrales thai confornialion or 
presentation of this protein is essential for the induction of neutralizing antibodies. These 
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results are consislenl with the observation thai the neutralizing antibodies produced when 
native CTB is used as an immunogen appear to recognize conformational epitopes [see section 
b) abtnej. This is the first demonstration that the conformation or presentation oi^ 
recombinant toxin B protein is essential to generate high liters of neutralizing antibodies. 

EXAMPLE 20 

Determination Of Ouanlilalive And Qualitative 
Differences Between pMB 1 750-2360. pMB 1 750-2360 (Cicrbu) 
Or pPB 1750-2360 IgY Polyclonal Antibody Preparations 

In li.xample 19, it was demonstrated that toxin B neutralizing antibodies could be 
generated using specific recombinant toxin B proteins (pPB 1750-2360) or specific adjuvants. 
Antibodies raised against pMB 1750-2360 were capable of neutralizing the enteroioxin effect 
of toxin B when the recombinant protein was used to immunize hens in conjunction with the 
(ierbu adjuvant, but not when I*reunds adjuvant was used. To determine the basis for these 
antigen and adjuvant restrictions, toxin B-spccific antibodies present in the neutralizing and 
non-neutralizing PFG preparations were isolated by affinity chromatography and tested for 
qualitative or quantitative differences. The example involved a) purification ot anti-toxin B 
specific antibodies from pMBI 750-2360 and pPBI750-236() PlXi preparations and b) in vivo 
neutralization of toxin B using the affinity purified antibody. 

a) Purification Of specific Antibodies From pMB 1750-2360 And 
pPBl 750-2360 PC(; Preparations 

To purify and determine the concentration of specific antibodies (expressed as the 
percent of total antibody) within the pFB1750-2360 (F-reunds and Gerbu) and pPBl 750-2360 
V\LG preparations, defined quantities of these antibody preparations were chromatographed on 
an affinity column containing the entire toxin B repeat region (pPB 1 750-2360). The amount 
of affinity purified antibody was then quantified. 

An affinity column containing the recombinant toxin B repeat protein. pPB 1 750-2360. 
was made as follows, l-our ml of PBS-washed Acligel resin (Sierogene) was coupled with 5 
mg of pPB 1750-2360 affinity purified protein (dialyzed into PBS: estimated to be greater than 
95% fidi length fusion protein) in a 15 ml tube (Palcon) containing I/IO final volume Aid- 
coupling solution (IM sodium cyanoborohydride). Aliquols of the supernatant from the 
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coupling reactions, before and after coupling, were assessed by Coomassie staining of 7.5% 
SDS-PAGE gels. Based on protein band intensities, greater than 95% (approximately 5 mg) 
of recombinant protein was coupled to the resin. The coupled resin was poured into a 10 ml 
column (BioRad). washed extensively with PBS. pre-cluicd with 4M guanidine-HCl (in 10 
mM Tris-UCL pi I 8.0; 0.005% thimerosal) and re-equilibrated in PBS and stored at 4°C 

Aliquols of pMBl 750-2360. pMBI750-2360 (Gerbu) or pPB1750-2360 IgY polyclonal 
antibody preparations (PEG preps) were affinity purified on the above column as follows. 
The column was attached to an I'V monitor (ISCO). and washed with PBS. l orty ml aliquots 
of 2X PF:G preps (filler sterilized using a 0.45 p filter and quantified by OI),^, before 
chromatography) was applied. The column was washed widi PBS until the baseline was re- 
established (the column llow-ihrough was saved), washed with BBSTween to elute 
nonspccifically binding antibodies and re-equilibrated with PBS. Bound antibody was eiuled 
from the column in 4M guanidine-ilCI (in 10 niM Tris-IICL. pH 8.0. 0.005% thimerosal) and 
the entire elution peak collected in a 15 ml lube (Kalcon). The column was rc-equilibraied, 
and tiie column eluate re-chromalographed as described above. 1 he antibody preparations 
were quantified by I IV absorbance (the elution buiTer was u.scd lo zero the 
speclrophotometer). Approximately 10 fold higher concentrations of total purified antibody 
was obtained upon elution of the llrsi ehrotiiatography pass rclali\e lo the second pass. I he 
low yield from the second chromatography pass indicated that mo.sl of the specific antibodies 
were remi)\ ed by the first round of chromatography. 

Pools o\ affinity purified specific antibodies were prepared by dialysis of the column 
eluies after the first column chromatography pass for ihe pMB 1 750-2360. pMB 1 750-2360 
((lerbu) or pPB 1750-2360 IgY polyclonal antibody preparations. The elutes were collected 
on ice and immediately dialy/.ed again.st a 100-fold volume of PBS at 4°C lor 2 hrs. The 
samples were then dialyzed against 3 changes of a 65-fold volume of PBS at 4°C. Dialysis 
was pertormed for a minimum of 8 hrs per change of PBS. The dialyzed samples were 
collected, centrifuged to remove insoluble debris, quantified by OD.jj^,, and .stored at 4"^. 

Phe percentage of toxin B repeat-specific antibodies present in each preparation was 
determined using the quantifications of antibody yields from the first column pass (amount of 
specific antibody recovered after first pass/total protein loaded). I he yield of repeat-specific 
aifinity purified antibody (expressed as the percent of total protein in the preparation) in: I) 
the pMB 1750-2360 PEG prep was approximately 0.5%. 2) the pMB 1 750-2360 (Gerbu) prep 
was approximately 2.3%. and 3) the pPB 1750-2360 prep was approximately 0.4%. 
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I^urificaiion of a CTB IgY polyclonal antibody preparation on the same column demonstrated 
that the concentration ot toxin B repeal specific antibodies in the CTB preparation was 0.35%. 

These results demonstrate that I) the use ofCierbu adjuvant enhanced the tiler of 
specific antibody produced against the pMBl 750-2360 protein 5-rold relative to immunization 
using Freunds adjuvant, and 2) the differences seen in the in vivo neutralization ability of the 
pMBI750-2360 {not neutralizing) and pPB1750-2360 (neuiralizing) and CTB (neutralizing) 
l*L(j preps seen in bl.xample 19 was not due to differences in the tilers of repeat-specific 
antibodies in the three preparations because the titer ol repeat-specific antibody was similar 
lor all three preps: therefore the differing ability of the three antibody preparations to 
neuirali/e loxin B must reflect qualitalive differences in the induced toxin B repeat- specific 
antibodies, fo confirm that qualiiaiive differences exist between antibodies raised in hens 
immunized with different recombinant proteins and/or different adjuvants, ihc same amount of 
atfiniiy purified anii-loxin B repeat (aa 1870-2360 ot toxm B) antibodies from the different 
preparations was administered to hamsters using the in vivo hamsicr model as described 
bclov\. 

Ii) In vivo Neutrati/atinn Of Toxin B Using Affinity Purified 
Antibody 

f he in vivo hamster model was utilized to assess the neutralizing ability of the affinity 
purilled antibodies raised again.si recombinant toxin B proteins piiriiled in (a) above. As well, 
a 4X IgY PEG preparation from a second independent immunization utilizing ihe pPB1750- 
2360 aniigen with Freunds adju\ant was tested lor in vivt> neulralizalion. The results are 
shown in fable 31. 

The results shown in Table 31 demonstrate that: 
1) as shown in Example 19 and reproduced here. 1.5 mg of affinity purified 

antibody from pMBl 750-2360 immunized hens using I reunds adjuvant does 
not neutralize loxin B in vivo. However. 300 j.ig of affmilN purified antibody 
from similarly immunized hens utilizing Cierbu adjuvant deiTJonstraicd complete 
neutral izaiion of toxin B in vivo. This demonstrates that Cievbu adjuvani. in 
addition to enhancing ihe tiler of antibodies reactive to the pMB 1750-2360 
antigen relative to freunds adjuvant (demonstrated in (a) above), also enhances 
the yield of neuiralizing antibodies to this aniigen. greater than 5 fold. 
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) Complete in vivo neutralization of toxin B was obsen'ed with 1 .5 nig of 

Dffinily purified antibody from hens immunized with pPB 1 750-2560 antigen, 
but not with pMB 1750-2360 antigen, when Freunds adjuvant was used. This 
demonstrates, using standardized toxin B repeat-spec i lie antibody 
concentrations, that neutralizing antibodies were induced when pPB 1 750-2360 
but not pMBl 750-2360 was used as the antigen with Freunds adjuvant. 

) Compieie in vivo neutralization was observed with 300 j.ig of pMB 1750-2360 
(Gerbu) antibody, but noi with 300 [xg of pPB 1 750-2360 (Freunds) antibody, 
fhus the pMBl 750-2360 (Cicrbu) antibody has a higher liter of neutral i/ing 
aniibodies than the pPBl 750-2360 (Freunds) antibody. 

) C omplete neutralization of toxin B was observed using ,300 ^.ig oi C TB 

aniihody {affinity purified (AP)) but not 100 jLig CTB antibody (AP or PRO 
prep). This demonstrates that greater than 100 ^g of toxin B repeat-specific 
aniibod> (anti-Cl B) in neces.sary to neutralize 25 |.ig toxin B in vivo in this 
assii_\. and that afllnily purified antibodies specific to the toxin B repeal interval 
neutralize toxin B as elfectively as the PFP prep ol' luY raised against ihe 
entire CTB protein (shown in this assay). 

) As was observed with the initial pPB 1750-2360 (IgY) PF(i preparation 
iFxample complete neutralization was ob.served with a IgY PFG 
preparation isolated from a second independent group of pPBl 750-2360 
(Freunds) immunized hens. This demonstrates thai neutralizing antibodies are 
reproducibly produced when hens are immunized vviih pPB 1750-2360 protein 
ulilizing freunds adjuvant. 
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TABLE 31 

hi vivo Neutralization Of Toxin B Usins: Afllnity Purified Antibodies 



Treaimcnt Group' 


Number Animals Alive'' 


Number Animals Dead** 


Preimmune' 


0 


5 


CTB (300 MJir 


5 


0 


CTB f 100 MU)- 


1 


4 


pMU 1750-2360 (C.) (5 mi»r 


5 


0 


pMB 1750-2360 (1.5 mgr 


5 


0 


pMB 1750-2360 (0) (300 pg)" 


5 


0 


pMH 1750-2360 (F) (L5 mgr 


0 




pPB 1750-2360 (( ) (1.5 mg)' 


5 


0 


pPB!750-2360 (300 pg)' 


1 


4 


( I B (100 pg) 


-> 




pPBI7S0-2360 (F)(500 pg) 


5 


0 



( ". ififftah- toxin B (CTB) ( Fech Lab) at lethal concentration lo hamsters (25 pg) was added to 
the antibody (amount ofspctiilc antibody is indicated) and incubated for one hour ai nx. 
Atier incubaiion. this mixture was injected IP into hamsters ( l '5 total mix injected per 
hamsicrl. Fach treatment group received toxin premixcd with aniib<>d\ raised agamst the 
-^^ indicated protein (G iierbu adjuvant. r~lTeunds adjuvant). ' indicates the antibody was a 4X 

IgY l'i:Ci prep; - indicates the antibody was alfinity pitritled on a pPB 1 850-2360 resui: and 
mdicaics that the antibody was a IX IgY PCG prep. 

The numbers in each group represent numbers ol hamsters dead or alive. 2 hrs post IP 
adminisiratmn of toxin antibody mixture 

KXAMPLEZI 

Diagnostic Mnzymc immunoassays l or ( '. ciif/icile l oxins A And B 

1 he ability of ihc lecotnbinant toxin proteins and antibodies raised aiiainst these 
rcconibinanl proteins (described in the above examples) to form the basis oi" diagnostic assays 
lor ilie detection of clostridial toxin in a sample was examined. Two immunoassay formats 
were tested to quantitatively delect ( '. ciiffici/e io\m A and toxin li from a biological 
specimen. I he first format involved a cotnpetitixc assay in which a fixed amount of 
35 recombinant toxin A or B was immobilized on a solid support (t' .i; , microliter plate wells) 
followed by the addition o\ a loxin-conlaining biological specimen mixed with alftnilv- 
purified or PKG fractionated antibodies against recombinant toxin A or B. If toxin is present 
in a specimen, this toxin will compete with the immobilized recombinant toxin protein for 
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binding to the anli -recombinant antibody thereby reducing the signal obtained following the 
addition of a reporter reagent. The reporter reagent detects the presence of antibody bound to 
the immobilized toxin protein. 

in the second formal, a sandwich immunoassay was developed using affinity-purified 
5 antibodies to recombinant toxin A and B. The affinity-purified antibodies to recombinant 

it)\in A and B were used to coal microliter wells instead of the recombinant polypeptides (as 
\\as done in the competitive assay format). Biological samples containing toxin A or B were 
then added to the wells followed by the addition ol a reporter reagent to delect the presence 
of bound toxin in the well. 

10 

a) Competitive Immunoassay For The Detection Of C. difficile 
Toxin 

Recombinant toxin A or B was attached to a solid support by coaling % well 
microiiicr plates with the toxin protein at a concentration of l|.ig/ml in PBS. The plates were 

15 mcubaled t)\ernighl ai 2-8T. The following morning, the coaling solutions were removed 

and ihe remaining protein binding sites on the wells were blocked by filling each well with a 
IMiS solution containing 0.5% BSA and 0.05% Tween-20. Native ( *. c/i/flctlc toxin A or li 
( lech [,ab) was diluted to 4 |.ig/ml in stool extracts from healthy Syrian hamsters (Sasco). 
The stool extracts were made by placing fecal pellets in a 15 ml centrifuge tube: PBS was 

20 added al 2 nil/pellei and the lube was voriexed to create a uniform suspension. The tube was 
then cenlrifugcd at 2000 rpm lor 5 min at room temperature. I he supernatant was removed: 
this compri.ses the stool extract. Fifty yi\ of the hamster siool extract was pipetted into each 
well of ihe microliter plates to serve as the diluent for serial dilutions of the 4 |.ig/ml toxin 
.samples. One hundred of the toxin samples at 4 (.ig/mi was pipetted into the first row of 

25 wells in the microliter plate, and 50 yv\ aliquots were removed and diluted serially down the 
piale in duplicate. An equal volume of affinity purified anti-recombinani toxin antibodies |t 
ng/weil of anii-pM A 1X70-2680 antibody was used for the detection of toxin A: 0.5 ng/well of 
anli-pMB1750-2360((jerbu) was ii.sed for the detection of toxin B) were added to appropriate 
wells, and the plates were incubated at room temperature for 2 hours with gentle agitation. 

30 Wells serving as negative control contained antibody but no native toxin to compete for 
binding. 

Unbound toxin and antibody were removed by washing ihe plates 3 to 5 times with 
PBS containing 0.05% Tween-20. Following the wash step. 100 fil of rabbit anti-chicken IgG 
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antibody conjugated to alkaline phosphatase (Sigma) was added to each well and the plates 
were incubated for 2 hours at room temperature. The plates were then washed as before to 
remove unbound secondary antibody. Freshly prepared alkaline phosphatase substrate ( 1 
mg/ml p-nitrophenyl phosphate (Sijima) in 50 mM NaXX);. pH 9.5: H) mM MuCK) was 
added to each well. Once sufficient color developed, the plates were read on a Dynaiech 
MR70() microliter plate reader using a 4)0 nm filter. 

The re.sulls are summarized in Tables 32 and 33. I or the results shown in Table 32. 
the wells were coaled with recombinant toxin A protein (pMA 187{)-26K()). I lie amount ol 
native toxin A added (present as an addition to solubilized hamster stool) to a uiven well is 
indicated (0 to 200 ng). Antibody raised against the rect)mbinanl toxin A protein, pMA1870- 
2680. was affinity purified on the an affinity column containing pPA 1870-26X0 (described in 
Example 20). As shown in Table 32. the recombinant toxin A protein and affinity- purified 
antitoxin can be used Jbr the basis oi a competitive immunoassay for the detection of toxin A 
in biological .samples.. 

Similar results were obtained using the recombinant toxin B. pPB 1750-2360. and 
antibodies raised against pMBl750-2360((ierbu). For the results shown in fable 33. the 
wells were coaled wilh recombinant toxin B protein (pPB 1 750-2360). I he amount of native 
toxin B added (present as an addition to solubilizcd humster stool) to a given well is indicated 
(0 to 200 ng). Antibody raised against the recombinant toxin B protein. pMB!750- 
236()((ierbu). was affinity purified on the an affinity column containing pPBI 850-2360 
(described in lixample 20 1- As shown in Fable 33. the recombinant toxin B protein and 
ailniiiy-puriiled antitoxin can be used for the basis of a competitive immunmissay for the 
detection of toxin B in biological samples. 

In this competition assay, the reduction is considered significant over the background 
levels at all points: therefore the assay can be used to detect samples containing less than 12.5 
ng toxin A/well and as little as 50-100 ng toxin IVwell. 
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TABLE 32 



Compeiiiive Inhibition Of Anii-C. difficile Toxin A By Naiive Toxin A 



15 



ng Toxin A/ We 11 


OD,p, Readout 


200 


0.176 


100 


0.253 


50 


0.240 


25 


0.259 


12.5 


0.309 


6.25 


0.367 


3.125 


0.417 


0 


0.590 


TABLE 33 

( onipciitivc Inhibiiion Of Anti-C. iiif/icilc Toxin B By Nmivc I'oxin ii 


Mi: Toxin B/Weil 


OD„„ Keadoul 


200 


0.392 


100 


0.566 


5t» 


0.607 


25 


0.778 


12.5 


0970 


6.25 


0.902 


.V125 


1-040 


0 


1.055 



--^ These competilivc inhibition assays denionslrale that native C. difficile io.\ins and 

iccombinant ( ' Jifjlicilc loxin proteins can compete for binding to antibodies raised against 
recombinant ( ' (iifficilc toxins demonstrating that these anti-recombinant toxin antibodies 
provide etTcciive diagnostic reagents. 

30 b) Sandwich Immunoassay For The Detection Of C difficile 

Toxin 

Arilnity-purified antibodies against recombinant loxin A or loxin B were immobiii/.ed 
to 06 well microliter plalcs as follows. Tiie wells were passively coated overnight at 4°C 
with aHIniiy purified antibodies raised against either pMAl 870-2680 (toxin A) or pMB1750- 
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2360(Cicrbu) (toxin B). The antibodies were affinity purified as described in Example 20. 
The antibodies were used at a concentration of I )iig/ml and 100 |.tl was added to each 
microtiter well. The wells were then blocked with 200 ^^il of 0 5% BSA in PBS for 2 hours 
at room temperature and the blocking solution was then decanted. Stool samples from healthy 
5 Syrian hamsters were resuspended in PBS, pi I 7.4 (2 ml PBS/stool pellet was used to 
rcsuspend the pellets and the sample was cenirifuged as described above). The stool 
suspension was then spiked with native i\ JifficUe toxin A or B ( fech Lab) at 4 |.ig/ml. The 
siool suspensions containing toxin (cither toxin A or toxin B) were then serially diluted two- 
fold in stool suspension without loxin and 50 was added in duplicate lo the coaled 
H) microliter wells. Wells containing stool suspension without toxin served as the negative 
control. 

fhe plates were incubated for 2 hours at room temperature and then were v\ ashed 
ihree limes with PBS. One hundred ^1 of either goal anii-naiive toxin A or goat anti-native 
loxin li ( l ech Lab) diluted 1:1000 in PBS containing 1% BSA and 0.05% Tween 20 was 

15 added lo each well . The plates were incubated for another 2 hours at room tempcraiure. 

1 he plates were then washed as before and 100 f.il of alkaline phosphatasc-conjugated rabbit 
anti-goat IgG K'appel. Durham. N.C.) was added at a dilution of 1:1000. I he plates were 
incubated for another 2 hours at room lemperaiure. The plates were washed as before ihen 
developed by the addition of 100 (.il/well of a substrate solution containing 1 mg/ml p- 

20 niirophenyl phosphate (Sigma) in 50 mM NaXO;. pH 0.5: 10 mM MgCI,. I lie absorbance of 
each well was measured using a plate reader (Dynaiech) at 410 nni. The assay resulis are 
shown in fables 34 and 35. 



TABLE 34 

( ■ Jtfficifc loxin A Detection In Stool Usina AfUniiy-Piiriricd AniitHX-lics Agauisi Toxin A 



ng Toxin A/ Well 


()D,„, Readout 


200 


0.9 


100 


0.8 


50 


0.73 


25 


0 71 


12.5 


0.59 


6.25 


0.42 1 


0 


0 
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TABLE 35 

C difftcik' Toxin B Detection In Stool Using Afflnity-Purirtcd Anribodies Aiiatn.si Toxin B 


ng Toxin B/Well 


OD,,„ Readout 


2(){) 


1.2 


100 


0.973 


SO 


0.887 


25 


0.846 


12.5 


0.65 1 


625 


0.431 


0 


0.004 



The results shown in Tables 34 and 35 show that antibodies raised against recombinant 
n>xin A and toxin B IVagnients can be used to delect the presence ol' T. Jif/ki/c toxin in stool 
samples. Ihcse antibodies form the basis for a sensitive sandwich immunoassav which is 
capable ofdcieciinii as little as 6.25 ng of either toxin A or B in a 50 yd stool sample. As 
shown above in Fables 34 and 35. the background for this sandwich immunoassay is 
extremel} low: therefore, the sensiii\ ii\ of this assay is much lower than 6.25 ng toxin/well. 
It is likely that toxin levels of 0.5 lo KO pg/wcll could be detected by this assay. 

The results shown above in Tables 32-35 demonstrate clear utility of the recombinant 
reagents in ( " difficile toxin detection systems. 

EXAMPLE 22 

C onstruction And Expression Of C' hotidmum (S Fragment f usion Proteins 



rhe ( hotulimmi type A neurotoxin gene has been cloned and sequenced [Thompson. 
a uL. Ihir. ,K Biochem. 189:73 (IWQ)]. The nucleotide sequence of the toxin gene is 
available from the EMBL/GcnBank sequence data banks under the accession number X52066: 
the nucleotide sequence of the coding region is listed in SCQ ID NO:27. I he amino acid 
sequence of the C\ hoiidmum type A neurotoxin is listed in SLQ ID Nt):2K. 1 he type A 
neurotoxin gene is synthesized as a single polypeptide chain which is processed to form a 
dimer composed of a light and a heavy chain linked via disulfide bonds, fhe 50 kD carboxy- 
terminal iwrtion of the heavy chain is referred to as the C fragment or the 1 1^ domain. 

Previous attempts by others lo express polypeptides comprising the C fragment of C' 
hoiulmum type A toxin as a native polypeptide [e.^.. not as a fusion protein) in E. coli have 
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been unsuccessful [H.F. LaPcnotiere. et aL in Botulinum and Tetanus Ncwowxins. DasGupta. 
Ed,. Plenum Press. New York (1993), pp. 463-466]. Iixpression of the C fragment as a 
i'usion with the £. coH MBP was reported to result in the production of insoluble protein 
(H.F. LaPcnotiere. et at., supra). 

In order to produce soluble recombinant C fragment proteins in E. coli. fusion proteins 
comprisinii a synthetic C fragment gene derived from the C*. hotulinum type A toxin and 
either a portion of the C. difficile toxin protein or the MBP were constructed, fhis example 
involved u) the construction of plasmids encoding C iVagmenl fusion proteins and b) 
expression o\ i\ hotulimmi C fragment fusion proteins in E colt. 

a) Construction Of Plasmids Encoding C Fragment Fusion 
Proteins 

In Rxampie 1 1, it was demonstrated that the C difficile toxin A repeal domain can be 
efficienllN expressed and purified in E. coli as either native (expressed in the pL'f 23a vector 
in clone pPA 1870-2680) or fusion (expressed in the pMAl.c vector as a fusion with the E. 
coli M\W in clone pMA 1 870-2680) proteins, f usion proteins comprising a fusion between 
the MBP, portions ol" the ( *. difficile toxin A repeat domain (shown to be expressed as a 
soluble lusion protein) and the C fragment of the C' hotulinum type A toxin were constructed. 
A lusion protein comprising the C fragment of the ( ' hotulinum type A toxin and the MBP 
was also constructed. 

f igure 25 provides a schematic representation of the boiuhnal lusion proteins along 
with the donor constructs containing the ( ' difficile toxin A sequences or ( hotulimuu V 
IVagmenl sequences which were used to generate the boiulinal fusion proteins, hi Figure 25. 
the solid boxes represent C. difficile toxin A gene sequences, the open bo.xes represent < . 
hotulinum C fragment sequences and the solid black ovals represent the E. coli MBP. When 
the name for a restriction enzyme appears inside parenthesis, this indicates that the restriction 
site was destroyed during construction. An asterisk appearing with the name for a restriciion 
enzyme indicates that this restriction site was recreated at the cloning Junction. 

In Figure 25. a rcslriclion map of the pMA]870-2680 and pPA! 100-2680 constructs 
(described in fixample 1 1) which contain sequences derived iVom ihc ( ' difficile toxin A 
repeal domain arc shown: these constructs were used as the source off difficile toxin A gene 
.sequences lor the construction of plasmids encoding iusions between the ( *. hotulinum C 
fragment gene and the (' difficile toxin A gene, fhe pMA 1 870-2680 expression construct 
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expresses high levels of soluble, intact fusion protein (20 mg/liler culture) which can be 
affinity purified on an amylose column (purification described in Example I Id). 

The pAllerBol construct (Figure 25) was used as the source of C\ boiulhmm C 
Iragnieni gene sequences for the boiulinal fusion proteins. pAlterBoi was obtained from J. 
5 Middlcbrook and R. Lemiey at the U.S, Department of Defense. pAllerBot contains a 

synthetic ( hotulmum C fragment inserted in to the pALTER-K«> vector (Promcga). This 
synthetic C fragment gene encodes the same amino acids as does the naturally occurring C 
fraumeni gene. The naturally occurring C fragment sequences, like most clostridial genes, are 
extremely A/T rich ( I hompson ei al.. supra). This high A/T content creates expression 

10 dilficulties in E. colt and yeast due to altered codon usage frequency and fortuitous 

polyadenyiation sites, respectively. In order to improve the expression of C fragment proteins 
in /, ct)/f, a synthetic version oi' the gene was created in which the non-preferred codons were 
replaced with preferred codons. 

I he nucleotide sequence of the ( hotulimim C fragment gene sequences contained 

15 within pAlierBot is listed in SI:Q ID NO:22. The first six nucleotides (ATGGCT) encode a 
methionine and alanine residue, respectively. These two amino acids result from the insertion 
of the (* iyntufhntm C fragment sequences into the pALTER'K) vector and provide the initiator 
methionine residue. The amino acid sequence of the hofulimmi C fragment encoded by the 
sequences contained within pAlterBot is listed in SEQ ID NO:23. I he first two amino acids 

20 (Mel Ala) are encoded by vector-derived sequences. From the third amino acid residue 

imward t Arg). the amino acid sequence is identical to that found in the C hntnlinum type A 
loxin j^ene. 

I he pMA 1 870-2680. pPAl 100-2680 and pAlterBol constructs were used as progenitor 
piasmids lo make expression constructs in which fragments of the (' difficile loxin A repeal 

25 domain were expressed as genetic fusions with the hoiiilimtm C fragment gene using the 
pMAl.-c expression vector (New England BioLabs). The pMAE-c expression vector 
generates fusion proteins which contain the MBP at the amino-lerminal end of the protein. A 
construct- pMBot, in which the f *. hotulinum C fragment gene was expressed as a fusion with 
only the MBP was constructed (Figure 25), Fusion protein expression was induced from E. 

30 coli strains harboring the above piasmids. and induced protein was affinity purified on an 
amylose resin column. 
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i) Construction Of pBlueBot 

In order to facilitate the cloning of the C. hotidinum C fragment gene sequences into a 
number of desired constructs, the botulinal gene sequences were removed from pAltcrBot and 
were inserted into ihe pBluescript plasmid (Straiauene) to generate pBiueBol (f igure 25). 
5 pBlucBot was constructed as follows. Bacteria containing the pAltcrBol plasmid were grown 
in medium containing tetracycline and plasmid DNA was isolated using the QiAprep-spin 
Plasmid Kit (Qiagen). One microgram of pAllerBot DNA was digested with Xcol and the 
resulting 3" recessed sticky end was made blunt using the Klenow fragment of DNA 
polymerase I (here after the Klenow fragment). I hc pAllerBot DNA was then digested with 

10 Him/Wl to release the botulinal gene sequences (the Bot insert) as a blunt (filled \co\ site)- 
/-//m/III fragment. pBiueseripi vector DNA was prepared by digesting 2()() ng of pBluescript 
DNA with Sma\ and HineAU. The digestion products from both plasmids were resolved on an 
agarose gel. The appropriate fragments were removed from the gel. mixed and purified 
utilizing the Prep-a-Cicne kit (BioRad). The elutcd DNA was then ligaicd using T4 DNA 

15 ligase and used to transfonn competent DH5a cells Kiibco-BRI.). Host cells were made 

competent for iransformalii>n using the calcium chloride protocol of Sambrook ci oL. supra at 
I.X2-l.sr>. Recombinant clones were isolated and confirmed by restriction digestion using 
standard recombinant molecular biology techniques (Sambrook al. supra). I he resultant 
clone. pBlueBoi. contains several useful unique restriction sites Hanking the Bot insert (rc.. 

20 the (' hondimuu C fragment sequences derived from pAlterBot) as shown in f igure 25, 

ii) C.:(>nstruction Of C\ difficiie / C botulhtttm f 
MBP Fusion Proteins 

C\)nstructs encoding fusions between the C. Jifjfkilc toxin A gene and the C. hondmum 
25 C iragment gene and the MBP were made utilizing the same recombinant DNA methodology 
outlined above: these fusion proteins contained varying amounts of the (' cii/ficilc toxin A 
repeal domain. 

fhe pMABot clone contains a 2.4 kb insert derived Irom the (* difficile toxin A gene 
fused to the Bot insert {i.c, the C. hofulimun C fragment sequences derived from pAlterBol). 
30 pMABoi (figure 25) was constructed by mixing gel-purified DNA from Sotl/I^incilU digested 
pBltieBot (the 1.2 kb Bot fragment). .Spc'l/Ao/I digested pPA 11 00-2680 (the 2.4 kb ( '. difficile 
toxin A repeat fragment) and Xhai/IIincflU digested pMAL-c vector. Recombinant clones 
were isolated, confirmed by restriction digestion and purified using the QiAprcp-spin Plasmid 
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Kit (Oiagen). This clone expresses the toxin A repeats and the botuiinal C fragment protein 
sequences as an in-frame fusion with the MBP. 

The pMCABot construct contains a 1.0 kb insert derived from the C\ difficile toxin A 
gene fused to the Bot insert {i.e. the (' hoiidimtm C fragment sequences derived from 
5 pAltcrBol). pMCABot was constructed by digesting the pMABot clone with EcoKX to 
remove the 5* end of the (' liifftcile toxin A repeal (see Figure 25. the pMAL-c vector 
contains a EcoR] site 5' to the C. Jrfficile insert in the pMABol clone). The restriction sites 
were filled and rcligated together after gel purification. The resultant clone (pMCABol. 
i igure 25) generated an in-frame fusion between the MBP and the remaining 5" portion of the 

H) ( *. tin f idle toxin A repeat domain fused to the Bot gene. 

The pMNABot clone contains the I kb Spel/EcoRl (filled) fragment from the (' 
cfifficilc toxin A repeat domain (derived from clone pPAl 100-2680) and the 1,2 kb i\ 
hntitlintmj C Iragmeni gene as a Xcol (filled)////wAIII fragment (derived from pAlterBot). 
I hcse two fragments were inserted into the pMAL-c vector digested with Xhu\IHinc{\\\. The 

15 iwo insert fragments were generated by digestion of the appropriate piasmid with FxoR\ 

(pPAl 10()-268()) or \'c(>\ (pAlterBot) followed by treatment with the Klenow fragment. After 
treatment with the Klenow fragment, ihe plasmids were digested with the second enzyme 
(either Spc\ or tlimhW). All three fragments were gel purified, mixed and Prep-a-Genc 
purified prior to ligation. F-ollowing ligation and transformation, putative recombinants were 

20 anal\ zed by restriction analysis: the EcoK\ site was ibund to be regenerated at the fusion 
junction, as was predicted lor a fusion between the filled EcoKl and Nco\ sites. 

A construct encoding a fusion protein between the botuiinal C fragment gene and the 
>A\W gene was constructed (/.t'.. this fusion lacks any C\ Jifficile toxin A gene sequences) and 
termed pMBot. I he pMBot construct was made by removal of Ihe C. dif/icilc toxin A 

25 sequences from the pMABot construct and fusing the C fragment gene sequences to the MBP. 
This was accomplished by digestion of pMABol DNA with (located in the pMALc 
j^^^iylinker 5* to the Xhu\ site) and Xhal (located 3' to the Noil site at the loxA-Rol fusion 
junction), filling in the Xlia\ site using the Klenow fragment, gel purifying the desired 
restriction fragment, and ligattng the blunt ends to circularize the piasmid, following ligation 

j() and transformation, pulatix e recombinants were analyzed by restriction mapping of the Bot 
insert ii.c, the ( *. hoiidinum C IVagment sequences). 
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b) Expression Of C bowlinum C Fragment Fusion Proteins In 

E, cofi 

Large scale (I liter) cultures of the pMAL-c vector, and each recombinant construct 
described above in (a) were grown, induced, and soluble protein fractions were isolated as 
5 described in Example (8. The soluble protein extracts were chroniatographed on amylosc 
aninily columns to isolate recombinant fusion protein. The purified recombinant fusion 
proteins were analyzed by running samples on SDS-PACjE gels followed by Coomassic 
staining and by Western blot analysis as described (Williams cf ai. (1994) supra]. In brief, 
extracts were prepared and chromaiographcd in column bufler ( 10 mM NaPO^. 0.5 M NaCI. 

10 10 mM p-mercaploelhanoK pH 7.2) over an amylose resin (New England Biolabs) column, 
and elutcd with column buffer containing 10 mM maltose as described (Williams, ci al. 
( 1994). supra]. An SDS-PACiE gel containing the purified protein samples stained with 
C'oomassie blue is shown in Figure 26. 

In f igure 26. the Ibilowing samples were loaded. Lanes 1-6 contain protein purified 

15 from a; loIi containing the pMAL-c. pPAl 870-2680. pMABol. pMNABot, pMCABot and 

pMBol plasmids. respectively. Lane 7 contains broad range molecular weight protein markers 
(BioRad). 

The protein samples were prepared for electrophoresis b\ mixing 5 ^il of eluled protein 
w ith 5 pi of 2X SDS-PAGE sample buffer (0.125 mM rri.s-HCM. p(l 6.8. 2 mM ED I A. 6% 

20 SDS. 20% glycerol. 0.025% bromophenol blue; (^mercaptoelhcinol is added to 5% before 

use). I he samples were heated to 95°C for 5 min. then cooled and loaded on a 7.5% agaro.se 
SDS-PACil- gel. Broad range molecular weight protein markers were also U>aded to allow 
estimation of the MW of identified fusion proteins. Atter electrophoresis, protein was 
detected generally by staining the gel with Coomassic blue. 

25 In all cases the yields were in excess of 20 mg fusion pr(ttein per liter culture (see 

Table 36) and. with the exception of the pMCABot protein, a high percentage {ie,. greater 
than 20-50% of total eluted protein) of the eluled iusion protein was of a MW predicted for 
the full length fusion protein (figure 26). It was estimated (by visual inspection) that less 
than 10% t)f the pMCABot fusion protein was expressed as the full length fusion protein. 
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TABLE 36 

Yield or Afllniiy Purified hoiuUmtm C Fraumeni / MBP Fusion Proteins 



Construct 


Yield (mg/lilcr of Culture) 


Percentaiie Of Total 
Soluble Protein 


pMAUoi 


24 


5.0 


pMCABol 


34 


5.0 


pMNABoi 


40 


5.5 


pMBot 


22 


5.0 


pMAIK70-268O 


40 


4.8 



10 I hcse results demonstrate that hiuh level expression of intact i\ hotulimtm C 

IVaiznieni'C * difficile toxin A fusion proteins in E, coh is feasible using the pMAL-c 
expression system. These results are in contrast to those reported by H. V. LaFcnoticrc, a al. 
( I'W). Mipra. In addition, these results show that it is not necessary to fuse the bolulinal C 
fragmeni gene to the ( '. difficile toxin A gene in order to produce a soluble liision protein 

15 using the pMAl,-c system in A" c(di. 

In order to determine whether the above-described bolulinal fusion proteins were 
recognized by anli-C hoittlimim toxin A antibodies. Western blots were performed. Samples 
containing aftiniiy-purified proteins iVom A' coli containing the pMABol. pMCABot. 
pMNABot. pMBt)t, pMA 1870-2680 or pMALc plasmids were analyzed. SDS-PAGt: gels 

20 (7.5% acrylamide) were loaded with protein samples purified from each expression construct. 
After electrophoresis, the gels were blotted and protein transfer was confirmed bv Ponceau S 
staining (as described in Lxample !2b). 

I t)ilowing protein transfer, the blots were blocked by incubation for 1 hr at 20°C in 
blocking buffer |PBST (PBS containing 0.1% Tween 20 and 5% dry milk)]. The blots were 

25 then incubated in 10 ml of a solution containing the primary antibody: this solution comprised 
a 1/500 dilution of an anti-C' hoiulinum toxin A IgV PEG prep (described in Example 3) in 
blocking buffer. The blots were incubated for 1 hr at rot>m temperature in the presence of the 
primary antibody. I he blots were washed and developed using a rabbit anti-chicken alkaline 
phosphatase conjugate (Boehringer Mannheim) as the secondary antibody as ibllovvs. Fhc 

30 rabbit anti-chicken antibody was diluted to I |ig/ml in blocking buffer (10 ml final volume 
per blot) and the blots were incubated at room temperature for I hour in the presence of the 
secondary antibody. The blots were then washed successively with PBS T. BBS-Tween and 
^0 mM Na.CO,. pH 9.5. The blots were then developed in freshly-prepared alkaline 
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phosphatase substrate buffer (100 ^ig/mJ nitro blue tetrazolium, 50 |.ig/ml 5-broiTio-chloro- 
indolylphosphate. 5 mM MgCl. in 50 mM Na,CO,. pH 9.5). Development was stopped by 
flooding the blots with distilled water and the blots were air dried. 

This Western blot analysis detected anii-C hotulmum toxin reaetive proteins in the 
5 pMABoi. pMCABol. pMNABot and pMBot protein samples (corresponding to the predicted 
full length proteins identified above by Coomassie staining in Figure 26), but not in the 
pMAI l{)0-2680 or pMALc protein samples. 

These results demonstrate that the relevant fusion proteins purified on an amylose resin 
as described above in section a) contained immunorcactive (* hotidmiim (S fragment protein as 
10 predicted. 

EXAMPLE 23 

Ciencration Of Neutralizing Antibodies 
By Nasal Administration Ol" pMBot Protein 

15 

I he ability of the recombinant boiulinal toxin proteins produced in lixample 22 to 
stimulate a systemic immune response against botulinal toxin epitopes was assessed. This 
example involved: a) the evaluation of the induction of serum igCi liters produced by nasal or 
oral administration of botulinal toxin-containing C. JifftcUc toxin A fusion proteins and b) 
20 the if! VIVO neutralization ol C . hotuliniim tvpe A neurotoxin bv anti- recombinant ( . 
hoiulim/m C fragment antibodies. 

a) Evaluation Of The induction Of Serum Titers Produced 
By Nasal Or Oral Administration Of Botulinal Tnxtn- 
25 Containing; C ilifficile Toxin A Fusion Proteins 

Six groups containing five 6 week old CF female rats (Charles River) per group were 
immunized nasally or orally with one of the following three combinations using protein 
prepared in I-.xample 22: (1) 250 ^g pMBol protein per rat (nasal and oral); 2) 250 |.ig 
pMAHoi protein per rat (nasal and oral); 3) 125 pg pMBot admixed with 125 f.ig pMAlH70- 
30 2680 per rat (nasal and oral). A second set of 5 groups containing 3 ( F I'emale rats/group 
were immuni/x'd nasally or orally with one of the following combinations (4) 250 pg 
pMNABot protein per rat (nasal and oral) or 5) 250 >.ig pMAl,-c protein per rat (nasal and 
oral). 
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The fusion proteins were prepared for immunizalion as follows. The proteins (in 
column buffer containing 10 mM maltose) were diluted in O.I M carbonate buffer, pH 9.5 and 
administered orally or nasally in a 200 yd volume. I hc rats were lightly sedated with ether 
prior to administration. The oral dosing was accomplished usine a 20 gauge feeding needle. 
5 The nasal dosing was performed using a P-200 micro-pipettor (Gilson). The rats were 

boosted 14 days after the primar\' immunization using the techniques described above and 
were bled 7 days later. Rats from each group were lightly etherized and bled from the tail. 
TIic biood was allowed to clot at 37°C for 1 hr and ihe serum was collected. 

The scrum from individual rats was analyzed using an HLISA to determine the anti-C. 

10 honflinum type A toxin IgG serum titer. The ELISA protocol used is a modification of that 
described in Uxample 13c. Briclly. %-well microliter plates (Falcon. Pro-Bind Assay Plates) 
were coaled with ( * boltdhnnn type A lo.void (prepared as described in txamplc 3a) by 
placing H)0 ul volumes of ('. hoinlhvim type A toxoid at 2.5 |.ig/ml in PBS containing 
0.005% thimero.sal in each well and incubating overnight at 4T. The next morning, the 

15 coating suspensions were decanted and all wells were washed three limes using PBS. 

In order to block non-specific binding sites, 100 yi\ of blocking solution [0,5% BSA in 
PBS) was then added to each well and the plates were incubated for 1 hr at 37°C. The 
blocking solution was decanted and duplicate samples of 150 pi of diluted rat scrum added to 
the first well of a dilution series. The initial testing serum dilution was 1:30 in blocking 

20 soliilion containing 0.5% Tween 20 followed by 5-fold dilutions into this solution. This was 
accomplished by serially transferring 30 f,il aliquots to 120 f.il blocking solution containing 
0 5% I ween 20. mixing, and repealing the dilution inlo a fresh well. After the final dilution. 
30 pi was removed from the well such that all w^clls contained 120 pi final volume. A tolal 
o( 3 such dilutions were performed (4 wells total), Hie plates were incubated I hr at 37°C. 

25 I-ollowing this incubation, the serially diluted samples were decanted and the wells were 

washed six times using PBS containing 0.5% 1 ween 20 (PBST). To each welL 100 pi of a 
rabbit anti-Rat IgG alkaline phosphatase (Sigma) diluted (I/IOOO) in blocking buffer 
containing 0.5% Tween 20 was added and the plate was incubated for 1 hr at 37'*C. I he 
conjugate solutions were decanted and the plates were washed as described above, substituting 

30 50 mM Na,CO„ pH 9.5 for the PBST in the final wash. The plates were developed by the 
addition of 100 pi of a solulion containing I mg/ml para-niiro phenyl phosphate (Sigma) 
dissolved in 50 mM Na.CO., 10 mM MgCk pH 9.5 to each well, and incubating the plates al 
room temperature in the dark tbr 5-45 min. The absorbcncy of each well was measured al 
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410 nm using a Dynalech MR 700 plaie reader. The results arc summarized in Tables 37 and 
38 and represent mean serum reactivities of individual mice. 

r.ABl.R 37 



t>cicmiinntinii ()[ AiUi-<* hottthnym Type A loxin Scrum liiCj Tilm I tiMowini! Imiiniiii/ntion With C. hotulnwm t' rrm!iiK'nt-C'iMilainin« 

I'lisiun l^rntcins 



Koulc ol linmunUutinn 


Nasal 


Oral 


linimiiioi:cn 


pRf:- 

IMMONF 


pMBot 


pMUot & 
pMAIX7t)- 


pMAIkx 


pMlkn 


pMUol/t 
pMAIX?n- 

nm 


pMAUol 


Dilution 




1 .\{\ 


(1 OKO 


1 IMO 




0 (»6() 


(» 100 


0 t)KO 


11 120 


1 15(1 


O0I7 


t).5«0 


t) 5*10 


»>t)32 


0070 


0020 


0 027 


1 7>0 


t)tHW 


t).2Hn 


i).2m 


0 010 


0 ll20 


0 010 


OOM 




t)iMI7 


(Kt84 






0<l(l*> 


0010 


0 007 


*' K;h> Icslcd 








> 




t 





NninhL'rs rcprcsciil the ;ivcr;i!:c values oblantcJ Irom two 1:1. IS A pbics. NiiuidardiyL'U uiili/inii ihc prciinnuinc C(Milri>l 



TABLE 38 



Determinaiion Of Aiiti-C*. hotuhnu/n t ype A roxin Serum IgCi Tiltrs 
r-ollowing Immunization Willi C*. hotulmum C Fragnieni-Comaininji Fusion Proicins 



Route ol" Immunization 


Nasal 


Oral 


lmmiino»!cn 


PRE-IMMUNE 


pMBol 


pMABot 


pMNABol 


pMNABol 


{dilution 




!:.>() 


0.040 


0.557 


0 010 


0.015 


0-010 


1 ; 1 5(1 


0.000 


0.383 


0.001 


0.003 


0.002 


1:750 


0.00! 


0.140 


0.000 


0.000 


0.000 


1:3750 


0.000 


o.two 


0.000 


(KOOO 


0.000 


« Rats Tested 




I 


1 


3 





The above RLISA results demonstrate that rcaciivity auain.si ihc boiulinal fusion 
proieins was strongest when the route of administration was nasal: only ucak responses were 
stimulated when the boiulinal fusion proteins were iiivcn orally. Nasally delivered pMboi and 
piVIBot admixed with pMA 1870-2680 invoked the greatest serum IgG response, fhese results 
show that only the pMBot protein is necessary to induce this response, since the addition of 
the pMA 1 870-2680 protein did not enhance antibody response ( I able 37). Placement oi ihe 
C. difficile ti>xin A fragment between the MBP and the C. houtlinwv C fragment protein 
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dramatically reduced anti-bot IgG titer (see results using pMABol, pMCABol and pMNABot 
proteins )- 

This study demonstrates that tlic pMBot protein induces a strong serum IgG response 
directed against (' hotulimtm type A toxin when nasally administered. 

b) /// Vivo Neutralization Of C botulinum Type A Neurotoxin 
By Anti- Kccombinant C. botuiinttm C Fragment Antibodies 

The ability of the anii-('. hodtlinum type A toxin antibodies generated by nasal 
administration ol recombinant botuiinal fusion proteins in rais (t-xample 22) to neutralize ( ' 
botulhufm type A toxin was tested in a mouse neutralization model. The mouse model is the 
an accepted method for detection of botuiinal toxins in body fluids and for the evaluation of 
anti-hoiulinai antibodies \liJ. Schantz and D.A. Kautter. J. Assoc. OtT. Anal. Chem. 61:96 
( !9*)()) and Investigational New Drug ( BB-lND-3703) application by the Surgeon General of 
the neparimcni of the Army to the Federal Food and Drug Administration |. The anti-C' 
botulint/m type A toxin antibodies were prepared as follows. 

Rats from the group given pMBol protein by nasal administration were boosted a 
second lime with 250 |,ig pMBoi protein per rat and serum was collected 7 days later. Serum 
from one riu iVom this group and from a preimmune rat was tested for anti-C" hondimmi type 
A lo.xin neutralizing activity in the mouse neutralization model described below. 

fhe LD^M of a solution of purified C botulinum type A toxin complex, obtained from 
Dr. Kric Johnson (University of Wisconsin Madison), was determined using the intraperitoneal 
(IP) method of Schantz and Kauiier |J- Assoc, Off. Anal. Chem. 61:06 (I97«)| using lS-22 
gram female ICR mice and was found to be 3500 LD,f/ml. The determination of the LD^,, was 
performed as follows. A Type A toxin standard was prepared by dissolving purilled type A 
toxin complex in 25 mM sodium phosphate buffer, pH 6.8 to yield a stock toxin solution of 
3T5 x 10' LD„/mg. The OD^^s the solution was detemnined and the concentration was 
adjusted to 10-20 Mg/ml. The toxin solution was then diluted 1:100 in gel-phosphate (30 mM 
phosphate, pll 6.4: 0.2% gelatin). Further dilutions of the toxin solution were made as shown 
below in fable 39. Two mice were injected IP with 0.5 ml of each dilution shown and the 
mice were observed for symptoms of botulism for a period of 72 hours. 
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TABLE 39 

Determination Of The LD,„ Of Purified C hoiuiimmt Type A Toxin Complex 



Dilution 


Number Dead At 72 hr 


1:320 


2/2 


t:640 


2/2 


1:1280 


2/2 


1:2560 


0/2 (sick after 72 hr) 


1:5120 


0'2 (no symptoms) 



10 From the results shown in Tabic 39. ihc toxin titer was assumed to he between 2560 

LDViil and 5120 LD,f/nil (or about 3840 IJ),ynil). This value was rounded to 3500 
LDs,/ml (or ihe sake of calculation. 

The amount of neutralizing antibodies present in the scrum of rats immunized nasally 
with piMBoi protein was then determined. Serum from two rats boosted with pMBoi protein 
15 as described above and preimmune serum from one rat was tested as follows. The toxin 
standard was diluted 1:100 in gcl-phosphatc to a final concentration of 350 LDv/tiil. One 
milliliter of the diluted loxin standard was mixed with 25 |.il of serum from each of the three 
rats and 0.2 ml of gel -phosphate. The mixtures were incubated at room temperature tor 30 
min with occasional mixing. Rach of two mice were injected with 11^ with 0.5 mi of the 
20 mixtures, f he mice were observed for signs of botulism for 72 hr. Mice receiving serum 
from rats immunized with pMBot protein neutralized this challenge dose. Mice receiving 
preimmune rat serum died in less than 24 hr. 

I he amount of neutralizing anti-toxin antibodies present in the serum of rats 
immunized with pMBol protein was then quantitated. .Serum antibody titrations were 
25 performed by mixing 0.1 ml of each of the antibody dilutions (see fable 40) with 0.1 mi of a 
1:10 dilution of stock loxin solution (3.5 x 10^ I.D^,/ml) with 1.0 ml of gel-phosphate and 
injecting 0.5 ml IP into 2 mice per dilution. The mice were then observed for signs of 
botulism for 3 days (72 hr). The results are tabulated in Table 39. 

As shown in l abic 40 pMBot serum neutralized C\ hotidhmm type A toxin complex 
30 when used at aidilulion of 1:320 or less. A mean neutralizing value of 168 lU/inl was 

obtained for the pMBot serum (an IIJ is defined as 10.000 mouse L-D,,,) I his value liansiates 
to a circulating serum titer of about 3.7 lU/nig of serum protein. I his neutralizing liter is 
comparable to the commercially available bottled concentrated (Connaught l.aboratories. Ltd.) 
horse anli-C. hotnlinum antiserum. A 10 ml vial of Connaught antiserum contains about 200 



- 152 - 



PCT/US97/15394 



mg/ml or proteinieach ml can neutralize 750 lU of C hotulinum type A toxin. After 
administration of one vial to a human, the circulating serum titer of the Connaught 
preparation would be approximately 25 lU/ml assuming an average serum volume of 3 liters). 
Thus, the circulating anti-f' bniidinum liter seen in rats nasally immunized with pMBot 
protein (168 lU/ml) is 6.7 time higher than the necessary circulation liter of anti-C. hondmum 
antibodx needed to be protective in humans. 

TABLE 40 



Oiiantitntion Of Neutralizing Antibodies In pMBot Sera 



Dilution 


pMBot* 


Rat 1 


Rat 2 


t:20 


2/2 


2/2 


1:40 


2/2 


2/2 




2 '2 


2 


1:160 


2 '2 


-> 


1:320 






1 :640 


0/2 


0/2 


1:1280 


0/2 


0/2 


! :2560 


0/2 


0/2 



Numbers rcprescni the number of mite survivini: at 72 hours which received serum taken (rom 
r;ns immunized with the pMBol protein. 

*■ These mice survived but were sick after 72 hr, 

t hese results demonstrate that antibodies capable of neutralizing C" hotulinum type A 
toxin are induced when recombinant C". hotulinum C fragment fusion protein produced in E 
co/i is used as an immunogcn. 

EXAMPLE 24 

Production Of Soluble C. botulimtm C I'ragment 
Protein Substantially Free Of Hndotoxin Contamination 

RxampJe 23 demonstrated that neutralizing antibodies are generated by immunization 
with the pMBoi protein expressed in E. coli. These results showed that the pMBot fusion 
protein is a good vaccine candidate. However, immunogens suitable for use as vaccines 
should he pyrogen-free in addition to having the capability of inducing neutralizing 
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antibodies. Expression clones and conditions that facilitate the production oi C hondmum C 
fragment protein for ulililization as a vaccine were developed. 

The example involved: (a) determination of pyrogen content of the pMBot protein: 
(b) generation oi C. hoiuUnum C fragment proiein free of the MBI^ fc) expression o{ C\ 
5 botulimtm C fragment protein using various expression vectors: and (d) purification of soluble 
C biUitlmum C fragment protein substantially free of significant endotoxin contamination. 

a) Determination Of The Pyrogen Content Of The pMBot 
Protein 

In order to use a recombinant antigen as a vaccine in humans or other animals, the 
antigen preparation must be shov^^n to be free of pyrogens. I'hc most significant pyrogen 
present in preparations of recombinant proteins produced in gram-negative bacteria, such as E. 
colt, is endotoxin |I*.C. Pearson. Pyrogens: endotoxins. LAL lestin^^ and ticpyro^emaion. 
( 1985) Marcel Dekkcr. New York, pp. 2.')-56]. To evaluate the utility of the pMBot protein 

15 as a vaccine candidate, the endotoxin content in MBP fusion proteins was determined. 

I hc endotoxin content of recombinant protein samples was assayed utilizing the 
I.imulus assay (LAL kit: Associates of Cape Cod ) according to the manufacturer" s 
instructions. Samples of affinity-purified pMal-c protein and pMA 1870-2680 were found to 
contain high levels of endotoxin I>50.000 EU/mg protein: III (endotoxin unit)]. This 

20 suggested that MBP- or toxin A repeat-containing fusions with the botulinal C fragment 
should also contain high levels of endotoxin. Accordingly, removal of endotoxin from 
affinity-purified pMal-c and pMBot protein preparations was attempted as follows. 

Samples of pMal-c and pMBol protein were dcpyrogenatcd with polymyxin to 
determine if the endotoxin could be easily removed. The following amount of protein was 

25 treated: 29 ml at 4.8 OD,j„/ml for pMal-c and 19 mis at L44 OD>H,/ml for pMBot. ( he 
protein samples were dialyzed extensively against PBS and mixed in a 50 ml lube (Falcon) 
with 0.5 ml PBS-equilibrated polymyxin B (Affi-Prep Polymyxin. BioRad). The samples 
were allowed to mix by rotating the tubes overnight at 4*^. The polymyxin was pelleted by 
centrifugation for 30 min in a bench top centrifuge at maximum speed (approximately 2000 x 

30 and the supernatant was removed. The recovered protein (in the supernatant) was 

quantified by OD;jj,„. and the endotoxin activity was assayed by LAL. In both cases only 
approximately 1/3 of the input protein was recovered and the polymyxin-ireated protein 
retained significant endotoxin contamination (approximately 7000 ITJ/mg of pMBot). 
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The depyrogenation experiment was repeated using an independently purified pMal-c 
protein preparation and similar results were obtained. From these studies it was concluded 
that significant levels of endotoxin copurifies with these MBP fusion proteins using the 
amylose resin. Furthermore, this endotoxin cannot be easily removed by polymyxin 
5 treatment. 

These results suggest that the presence of the tVIBP sequences on the fusion protein 
complicated the removal of endotoxin Irom preparations of the pMBol protein. 

b) Generation Of C hotuiinum C Fragment Protein Free Of 
10 The MBP 

It was demonstrated that the pMBot fusion protein could not be easily purified from 
coniaminaling endotoxin in section a) above. The ability to produce a pyrogcn-free (6^^,^. 
endotoxin -free) preparation of soluble holulinal C fragment protein free of the MBP tag was 
next investigated. I hc pMBot expression construct was designed to facilitate purification of 

15 the botulinal C" fragment from the MBP tag by cleavage of the fusion protein by utilizing an 
engineered Factor Xa cleavage site present between the MBP and the botulinal C fragment. 
The Factor Xa clea\age was performed as follows. 

l actor Xa (New England Biolabs) was added to the pMBot protein (using a 0. 1-LO% 
Factor Xa/pMBot protein ratio) in a variety of buffer conditions [c y,.. PBS-NaCI (PBS 

20 containing 0.5 M NaCI). PI3S-NaCl containing 0.2% Tween 20, PBS. PBS containing 0.2% 
I ween 20. PBS-C (PBS containing 2 mM CaCK). PBS-C containing either 0 1 or 0.5 % 
1 ween 20. PBS-C containing either 0.1 or 0.5% NP-40, PBS-C containing either O.I or 0.5% 
Triton X-100. PBS-C containing 0.1% sodium deoxycholaic. PBS-C containing 0.1% SDSJ. 
1 he |-aclor Xa digestions were incubated for 12-72 hrs at room temperature. 

25 rhe extent of cleavage was assessed by Western blot or Coomassie blue staining of 

proteins following electrophoresis on denaturing SDS-PAGE gels, as described in Example 
22. Cleavage reactions (and control samples of uncleaved pMBot protein) were cenirifuged 
for 2 min in a microfuge to remove insoluble protein prior to loading the samples on the gel. 
The f actor Xa treated samples were compared with uncleavrd. unccntrifuged pMBot samples 

."^0 on the same geJ. The results of this analysis is summarized below. 

I ) Most (about 90%) pMBol protein could be removed by centrifugation. even 
when uncleaved control samples were utilized. This indicated that the pMBot fusion protein 
was not fully soluble {i.e.. it exists as a suspension rather than as a solution). [This result was 
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consistent with the observation that most atFinity-purified pMBot protein precipitates after 
long term storage (>2 weeks) at 4°C. Additionally, the majority {re, 75%) oi* induced 
pMBot protein remains in the pellet after sonication and clarification of the induced E. colt. 
F^csuspcnsion of these insoluble pellets in PBS followed by sonication results in partial 
solubilization of the insoluble pMBot protein in the pellets.) 

2) The portion of pMBot protein that is fully in st^lution (about 10% of pMBot 
protein) is completely cleaved by Factor Xa, but the cleaved (released) botutinal C fragment is 
relatively insoluble such that only the cleaved MBP remains fully in solution. 

3) None of the above reaction conditions enhanced solubility without also 
reducing effective cleavage. Conditions that effectively solubilized the cleaved botulinal C 
fragment were not identified. 

4) The use of 0.1% SDS in the buffer used for Factor Xa cleavage enhanced the 
solubility of the pMBot protein (all of pMBot protein was soluble). However, the presence of 
the SI^S prevented any cleavage of the fusion protein with Factor Xa. 

5) Analysis of pelleted protein from the cleavage reactions indicated that both full 
length pMBot {i.e., uncleavcd) and cleaVed botulinal C fragment protein precipitated during 
incubation. 

I hese results demonstrate that purification ol soluble botulinal C fragment protein after 
cleavage of the pMBot fusion protein is complicated by the insolubility ot both the pMliot 
protein and the cleaved botulinal C fragment protein. 

c) Expression Of C botulinum C Fragment Using Various 
Expression Vectors 

In order to determine if the solubility of the botulinal C fragment was enhanced by 
expressing the C fragment protein as a native protein, an N-terminal His-tagged protein or as 
a fusion with glulathione-S-transferase (GST), alternative expression plasmids were 
constructed. I hese expression constructs were generated utilizing the methodologies described 
in FAample 22. Figure 27 provides a schematic representation ol* the vectors described below. 

In Figure 27, tlie following abbreviations are used. pP refers to the pI'T23 vector. 
pHIS refers to the pETHisa vector. pBlue refers to the pBluescript vector. pM refers to the 
pMAL-c vector and pG refers to the pGEX3'r vector (described in Example 1 1 ). I he solid 
black lines represent C. hofuUmtm C fragment gene sequences: the solid black ovals represent 
the MBP: the hatched ovals represent GST; "IIHHHH" represents the poly-histidine lag. In 
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Figure 27. when the name for a restriction enzyme appears inside parenthesis, this indicates 
that the resiriciion site was destroyed during construction. An asterisk appearing with the 
name for a restriction enzyme indicates that this restriction site was recreated at a cloning 
junction. 

■5 

i) Construction Of pPBot 

hi order to express the C\ hotulinum C iVagmcnl as a native {i.e.. non-fused) protein, 
ilic pfM^oi plasmid (shown schematically in f-igure 27) was constructed as follows. I he C 
fragment secjuences present in pAlterBot (lixample 22) were removed by digestion of 

Ki pAlterBot v\iih i\'col and llinJlU. The Ncol/flinJlU C fragment insert was iigatcd to 

pETlIisa vector (described in Example I8b) which was digested with Xco] and HincflU, This 
ligation creates an expression construct in which the Axv>l-encodcd methionine of the botuiinal 
C Iragmeni is the initiator codon and directs expression of the native botuiinal C lYaumeni. 
I he ligation products were used to transform competent BL2I(Dr:3)pl.ysS cells (Novagen). 

15 Recombinant clones were ideniilled by restriction mapping. 

ii) Construction Of pHisBot 

In order to express the C. hotidiniim C fragment containing a poly-hislidine lag at the 
amino-terminus of the recombinant protein, the pHisf^oi plasmid (shown schematically in 

20 Figure 27) was constructed as follows. The I\co\IHind\\\ botuiinal C fragment insert from 
pAlterbot was Iigatcd into tiic pETIiisa xector which was digested with A/;t'I and IlindlU. 
The \co\ (on the C iragment insert) and /V/id (on the pITHi.sa vector) sites were filled in 
using the Klenow tragmenl prior to ligation: these sites were then blunt end ligaied (the hdel 
site was regenerated at the clone junction as predicted). I he ligation products were used to 

25 transform comj^eient BL2HDE3)plA'sS cells and recombinant clones were identified by 
restriction mapping. 

riie resulting pi lisBot clone expresses the botuiinal C fragment protein wiili a 
histidine-tagged N-terminal extension having the following sequence: MetGlyl lisl lis 
HisHisHisMisHisHisHisIlisScrSerGlvHislleGluGlvAreMis MetAla. (SEQ 10 NO:24): the amino 

30 acids encoded by the botuiinal C IVagment gene arc underlined and the vector encoded amino 
acids are presented in plain type. The nucleotide sequence present in llie pBTIlisa vector 
which encodes the pHisBot fusion protein is listed in SEQ ID NO:25. The amino acid 
-sequence of tlie pi iisBol protein is listed in SEQ li^ NO:26- 
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iii) Construction Of pGBot 

The bolulinal C fragment protein was expressed as a fusion with the glulathione-S- 
transferase protein by constructing the pGBot plasmid (shown schematically in Figure 27), 
This expression construct was created by cloning the AV>/I/.SWJ C fragment insert present in 
5 pBlueBoi (Example 22) into the pGb:X3T vector which was digested with Smal and Xhol. 
The AV;/I site (present on the botulinal fragment) was made blunt prior to ligation using the 
Klenow fragment. I hc ligation products were used to transform competent UI.21 cells. 

Each of the above expression constructs were tested by restriction digesiion lo confirm 
the integrity oi" the constructs. 

10 Large scale ( I hter) cultures of pPBol [BL21(nL:3)pLy.sS host), pHisBot 

|BL21(DE3)pLysS host) and pGBot (BL21 host) were grown in 2X YT medium and induced 
(using IPTG to 0.8-1.0 mM) for 3 hrs as described in Example 22. Total, soluble and 
insoluble protein preparations were prepared from 1 ml aliquots ol each large scale culture 
I Williams cf uL { 1994). sujyra] and analyzed by SDS-PACiK. No obvious induced band was 

15 detectable in the pPBot or pHisBot samples by Coomassie staining, while a pnmiinent 

insoluble band of the anticipated MW was delected in the p<(Bot sample. Soluble lysates of 
the pGBot large scale (resuspcnded in PBS) or plIisBot large scale | resus|XNided in Novagen 
IX binding buffer (5 mM imidazole. 0.5 M NaCI. 20 mM rris-IIC'L pi I 7.0)| cultures were 
prepared and used to affinity purify soluble affinity-tagged protein as follows. 

20 The pGBot lysaie was affinity purified on a glutathione-agarose resin (Pharmacia) 

exactly as described in Smith and Corcoran (Current Protocols in Molecular Biology. 
Supplement 28 (1994). pp, 16.7.1- 16.7. 7|. The pllisBot protein was purified on the His-Bind 
resin (Novagen) utilizing the His-bind buffer kit (Novagen) exactly as described by 
manufacturer. 

25 Samples from the purification of both the pGBoi and pilisBoi proteins (including 

uninduced, induced, lotah soluble, and affinity-purified eluted protein) were resolved on SDS- 
PAGE gels. Following electrophoresis, proteins were analyzed by Coomassie staining or by 
Western blot detection utilizing a chicken anti-C. hotuiiniim lype A toxoid antibody (as 
described in Example 22). 

30 1 hese studies showed that the pCiBot protein was almost entirely insoluble under the 

utilized conditions, while the pHisBol protein was .st>luble. Affinity purification of the 
pl iisBot protein on this first attempt was inefficient, both in terms of yield (most of the 
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immunoreactive botulinal protein did not bind to the His-bind resin) and purity (the botulinal 
protein was estimated to comprise approximately 20% of the total eluted protein). 

d) Purification Of Soluble C botulinum C Fragment Protein 
5 Substantially Free Of Endotoxin Contamination 

The above studies showed that the pHisBot protein was expressed in E. colt as a 
soluble protein. However, the affinity purification of this protein on the Uis-bind resin was 
very inefficient. In order to improve ihe affinity purification of the soluble pHisBot protein 
(in terms of both yield and purity), an alternaiive poly-histidine binding affinity resin (Ni- 

10 NTA resin: Qiagen) was utilized. The Ni-NTA resin was reported to have a superior binding 
affinity (K^,^ I x 10 ai pH 8.0: Oiii^<^*'^ user manual) relative to the His-bind resin. 

A soluble lysate (in Novagen IX binding buffer) from an induced i liter 2X Y'l 
culture was prepared as described above. Briefly, the culture of pHisBot (Bl2Ur)R3)pLysS 
host 1 was grown at 37^ to an ODf^^^, of 0.7 in 1 liter of 2X YT medium containing 11)0 

15 Mir^^il ampicillin. 34 ^i\i/m\ chloramphenicol and 0.2% glucose. Protein expression was 

induced b> the addition of IPTCj to I mM. Three hours after the addition of the IPTG. the 
cells were cooled for 15 min in a ice water bath and then centrifuged 10 min at 5000 rpm in 
a JAM) rott>r (Bcckman) al 4°C. The pellets were resuspended in a total volume of 40 mis 
Novagcn IX binding buffer (5 mM imidazole. 0.5 M NaCL 20 mM Tris-HCl. pH 7,9). 

20 transferred lo two 35 ml Oakridge tubes and frozen at -70°C (or at least 1 hr. The tubes 
were thawed and the cells were iysed by sonication (4 X 20 second bursts using a Branson 
Sonillcr 45(1 with a power setting of 6-7) on ice. The suspension was clarified by 
ccnirifugaiion for 20 min at 9.000 rpm (10.000 x f^) in a JA-17 rotor (Reckman)- 

The soluble lysate was brought to 0.1% NP40 and then was batch absorbed to 7 ml of 

25 a 1:1 slurry of Ni-NTA resin:bindiug buffer by stirring tor I hr at 4^C. I he slurry was 

poured into a column having an internal diameter of i or 2.5 cm (BioRad). The column was 
then washed sequentially with 15 mis of Novagen \X binding buffer containing 0.1% NP40. 
15 ml of Novagen IX binding bufrcr. 15 ml wa.sh buffer (60 mM imidazole. 0.5 M NaCL 20 
mM fris-}K*l. pH 7.9) and 15 ml NaHPO, wash buffer (50 mM NaHPO,. pH 7.0. 0,3 M 

30 NaCl. U) % glycerol). The bound protein was eluted by proionalion of the resin using elulion 
buffer (50 mM NaHPOj. pM 4.0. 0.3 M NaCL 10 % glycerol). The eluted protein was stored 
at 4%\ 
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Samples of lolaK soluble and eluled protein were resolved by SDS-PAGE, Protein 
samples were prepared for electrophoresis as described in Example 22b. Duplicate gels were 
stained with Coomassic blue lo visualize the resist vcd proteins and C hotulmum type A toxin- 
reactive protein was detected by Western blot analysis as described in Example 22b. A 
representative Cooniassie stained gel is shown in Figure 28. In Figure 28. the following 
samples were loaded on the 12.5% acrylamidc gef Lanes 1-4 contain respectively total 
protein, soluble protein, soluble protein present in the (low-through of the Ni-N f A column 
and affmity-puritled pHisBot protein {i.e.. protein released from the Ni-Nl A resin bv 
proionation). Lane 5 contains high molecular weight protein markers (BioRad). 

fhe purification of pHisBot protein resulted in a yield of 7 mg oi affinity purified 
protein from a 1 liter starting culture of BL21(DE3)pLysS cells harboring the pHisBot 
plasmid. The yield of purified pHisBot protein represented approximately 0.4% of the total 
soluble protein in the induced culture. Analysis of the purified pHisBot protein by SDS- 
PACjK revealed that at least 90-05% of the protein was present as a single band (f igure 28) ol 
ihe predicted MW (50 kD). I his 50 kl) protein band was inimunoreaclive with anli-( * 
honilinuni i\ pe A toxin antibodies. I he extinction coefficient of the protein preparation was 
determined to be 1.4 (using the Pierce BCA assay) or 1.45 (using the Lowry as.say) OIX^,, per 
1 mg/ml solution. 

Samples of pl l neutralized eluted pHisBot protein were resolved on a KB 803 HPLC 
column iShodex). Although His-taggcd proteins are retained by this sizing column (perhaps 
due to the inherent metal binding ability of the proteins), the relative mobility of the pHisBol 
protein was consistent with that expected for a non-aggregated protein in solution. Most of 
the induced pHisBot protein was determined to be soluble under the growth and solubilization 
conditions utilized above (/.c.. greater than 90% ol the pHisBot protein was found to be 
soluble as judged by comparison of the levels of pHisBot protein seen in total and soluble 
protein samples prepared from BL21(DL3)pLysS cells containing the pUisBot plasmid). 
SDS-PACiE analysis of samples obtained after centrifugaiion. extended storage at -20°C. and 
at least 2 cycles of freezing and thawing detected no protein loss (due to precipitation), 
indicating that the pllisBot protein is soluble in the ekition buffer (/ cv, 50 niM NallPOj, pH 
4.0. 0.3 M NaCL 10 % glycerol). 

Determination oi' endotoxin contamination in the affinity purified pHisBot preparation 
(aikr pH neutralization) using the LAL assay (Associates oi'C'apc Cod) delected no 
significant endotoxin contamination. I he assay was performed using the endpoinl 
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chromogenic method (without diazo-coupling) according to the manufacturer s instructions. 
This method can detect concentrations of endotoxin greater than or equal to 0.03 HU/ml (HIJ 
refers to endotoxin units). The LAL assay was run using 0.5 ml of a solution comprising 0,5 
mg pHisBot protein in 50 mM NaHPO,. pi I 7.0. 0.3 M NaCL 10 % glycerol: 30-60 EV were 
5 detected in the 0.5 ml sample. Therefore, the affinity purified pIIisBol preparation contains 
60-120 EU/mg of protein. FDA Guidelines for the administration of parenteral drugs require 
that a composition to be administered to a human contain le.ss than 5 EU/kg body weight (The 
average human body weight is 70 kg: therefore up to 349 EU units can be delivered in a 
parental dose.). Because very small amount of protein are administered in a vaccine 

10 preparation (generally in the range of 10-500 |ig of protein), administration of affinity 

purified pI lisBot containing 60-120 ElJ/mg protein would result in delivery of only a small 
percentage of the permissible endotoxin load. For example, administration of 10-500 ^g of 
purified pliisBot to a 70 kg human, where the protein preparation contains 60 EU/mg protein, 
results in the introduction of only 0.6 to 30 EU {i.e.. 0.2 to 8,6% of the maximum allowable 

15 endotoxin burden per parenteral dose (less than 5 EU/kg body weight)]. 

The above results demonstrate that endotoxin (EPS) does not copurify with the 
pHisBot protein using the above purification scheme. Preparations of recombinantly produced 
pHisBot protein containing lower levels of endotoxin (less than or equal to 2 EU/ mg 
recombinant protein) may be produced by washing the Ni-NTA column with wash buffer until 

20 the OD.sr* returns to baseline levels [Lc, until no more UV-absorbing material comes off of 
the column). 

I he above results illustrate a method lor the production and purification of soluble, 
botulinal V tragment protein substantially free of endotoxin. 

25 EXAMPLE 25 

Optimization Of The Expression And Purification Of pHisBot Protein 

The results shown in Example 24d demonstrated that the pliisBot protein is an 
excellent candidate for use as n vaccine as it could be produced as a soluble protein in L. coli 
30 and could be purified free of pyrogen activity. In order to optimize the expression and 
purification of the pHisBot protein, a variety of growih and purification conditions were 
tested. 
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a) Growth Parameters 
i) Host Strains 

The influence of the host strain uiilizcd upon the production of soluble pIlisBol 
protein was investigated. A large scale purification oi pHisBot was pcrlbrmed [as described 
5 in Example 24d above] using the BL2!(DL:3) host (Novagen) rather than the 

BL21(DE3)pLysS host. The deletion of the pLysS plasmid in the BL21(Dn3) host yielded 
higher levels of expression due to de-repression of the plasniid s T7-lac promoter However, 
the yield of affinity-purified soluble recombinant protein was very low (approximaielv 600 
Mg/ liter culture) when puritled under conditions identical to those described in Example 24d 

10 above. This result was due to the fact that expression in the BE21fDE3) host vielded very 
high level expression of the pHisBot protein as insoluble inclusion bodies as shown by SDS- 
PACjE analysis of protein prepared from induced BL2I(DE3) cultures (Figure 2^). lanes 1-7. 
described below). These results demonstrate that the pl lisBoi protein is not inherently toxic 
to E. co/i cells and can be expressed to high levels using the appropriate promoter/host 

15 combination, 

l igure 29 shows a Coomassie blue stained SDS-PAGE gel (12.5% acrylamide) onto 
which extracts prepared from BL21(DE3) cells containing the pHisBol plasmid were loaded. 
Each lane was loaded with 2.5 ^.il protein sample mixed w ith 2.5 ^il of 2X SDS sample buffer. 
The samples were handled as described in Example 22b. I he following samples were applied 

20 lo the gel. Lanes 1-7 contain protein isolated from the BL2l(nE3) host. I,anes K-I4 contain 
proteins isolated from the BL2 1(DE3)pEysS host. loial protein was loaded in lanes I. 2. 4. 
6. 8. 10 and 12. Soluble protein was loaded in Lanes 3. 5. 7, 9, 1 1 and 13. Lane I contains 
protein from uninduced host cells. Lanes 2-13 contain protein from host cells induced for 3 
hours. IPTCi was added to a final concentration of 0.1 mM (Lanes 6-7). 0.3 mM (Lanes 4-5) 

25 or 1.0 mM (Lanes 2. 3. 8-13). The cultures were grown in LB broth (Lanes 8-9), 2X Y'l 

broth (Lanes 10-1 1) or terrific broth (Lanes 1-7. 12-13). The pIlisBol protein seen in Lanes 
3. 5 and 7 is insoluble protein which spilled over from Lanes 2. 4 and 6. respectively. High 
molecular weight protein markers (BioRad) were loaded in Lane 14. 

A variety of expression conditions were tested to determine if the BL21(DE3) host 

30 could be utilized lo express soluble pHisBol protein at suitably high levels {i.e., about 10 
mg/ml). The conditions altered were temperature (growih at 37 or 30^*0. culture medium 
(2X YT. LB or ferrific broth) and inducer levels (0.1. 0.3 or 1.0 mM I P I G). All 
combinations of these variables were tested and the induction levels and solubilitv was then 
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assessed by SDS-PAGE analysis of total and soluble extracts [prepared from 1 ml samples as 
described in Williams et ai. (1994), supra]. 

All cultures were grown in 15 ml tubes (Falcon #2057). All culture medium was 
prewarmed overnight at the appropriate temperature and were supplemented with 100 ng/ml 
^ ampicillin and 0.2% glucose. Terrific broth contains 12 g/1 bacto-tryptone. 24 g/l bacio-yeast 
extract and U)0 ml/1 of a solution comprising 0.17 M KH.POj. 0.72 M KJlPO, Cultures 
were grown in a incubator on a rotating wheel (to ensure aeration) to an 01\,^, of 
approximately 0.4. and induced by the addition of IP! G. In all cases, high level expression 
of insoluble pHisBol protein was observed, regardless of temperature, medium or inducer 
10 concentration. 

I he ciVcci of varying the conceniraiion of IP I G upon 2X YT cultures grown at 23°C 
was then investigated. IPTG was added to a final concentration of cither 1 mM. O.l mM. 
*>.()5 mM or O.Ol mM. At this temperature, similar levels of pHis Bot protein was induced in 
the presence of either 1 or 0.1 mM IPTG: these levels of expression was lower than thai 

15 observed at higher temperatures. Induced protein levels were reduced at 0.05 mM IPTG and 
absent at 0.0! niM IPTG (relative to 1.0 and 0.1 mM IPTG inductions at 23^0. However, 
no conditions were observed in which the induced pHisBot protein was soluble in this host. 
Thu.s, allhouuh expression levels are superior in the BL2I(DFJ) host (as compared to the 
BL21(D[:3)pLysS host), conditions that facilitate the production of soluble protein in this host 

20 could ntii be identified. 

I hese results demonstrate that production of soluble pliisBol protein was achieved 
using the BL21(DE3)pl.ysS host in conjunction with the T7-lae promoter. 

ii) Hffcct Of Varying Temperature^ Medium And 
25 IPTG Concentration And Length Of Induction 

The efleei growing the host cells in various mediums upon the expression of 
recombinant botulinai protein from the pHisBot expression construct [in the BL21{l)r.3)pi.ysS 
host) was investigated. BL21(DF.3)pLysS cells containing the pHisBot plasmid were grown 
in either LB. 2X Y f or Terrific broth at 37°C. The cells were induced using I mM IPTG for 
30 a 3 hr induction period. IZxpression of pHisBot protein was found to be the highest when the 
cells were grown in 2X YT broth (see figure 29. lanes 8-13). 

The cells were then grown at 30°C in 2X YT broth and the concentration of IPTCj was 
varied from 1 .0. 0.3 or 0. 1 niM and the length of induction was either 3 or 5 hours. 
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F.xpression of pHisBol protein was similar at al! 3 inducer concentrations utilized and the 
levels oi* induced protein were higher after a 5 hr induction as compared to a 3 hr induction. 

Using the conditions found to be optimal for the expression of pHisBol protein, a large 
scale culture was grown in order to provide sufficient material for a large scale purification of 
the pHisBot protein, fhrce 1 liter cultures were grown in 2X YT medium containing 100 
Hg/ml ampicillin, 34 |.ig/ml chloramphenicol and 0.2% glucose. The cultures were grown at 
30*^C and were induced with 1.0 mM IPTG for a 5 hr period. The cultures were harvested 
and a soluble lysalc were prepared as described in Example 18. A large scale purification 
was performed as described in Example 24d with the exception that except the soluble lysate 
was batch absorbed for 3 hours rather than for 1 hour. 1 he tlnal yield was 13 mg pHisBot 
protein/liter culture. The pHisBot protein represented 0.75% of the total soluble protein. 

The above results demonstrate growth conditions under which .soluble pllisBol protein 
is produced {i.e.. use of the BL2l(DE3)pEysS host. 2X Y l medium. 30%\ fO mM IPTG for 
5 hours). 

h) Optimi/ation Of Purification Parameters 

I or optimization of purification conditions* large scale cultures (3 X 1 liter) were 
grown at 3()°C and induced with I niM IPTG for 5 hours as described above. The cultures 
were pooled, distributed to centrifuge bottles, cooled and pelleted as described in Example 
24d. 1 he cell pellets were frozen at -70°C until used. Each cell pellet represented 1/3 of a 
liter starting culture and individual boUles were utilized for each optimization experiment 
deseribed below. This standardized the input bacteria used fi>r each experiment, such that the 
yields of afllnily puriHed pHisBol protein could be compared between different optimization 
experiments. 

i) Binding Specificity (pi! Protonation) 

A lysate ol' pHisBot culture was prepared in PBS (pH 8.0) and applied to a 3 ml Ni- 
NTA column equilibrated in PBS (pH 8.0) using a How rate of 0.2 ml/min (3-4 column 
volume.s/hr) using an Econo chromatography system (BioRad). Hie column was washed with 
PBS (pH X.O) until the absorbance (C)D>h(>) of the eluic was at baseline levels. Hie flow rale 
was then increased to 2 nil/min and the column was equilibraled in 1*BS (pll 7.0). A pM 
gradient (pH 7.0 to 4.0 in PBS) was applied in order to clule the bound pHisBoi protein from 
the column, i-raclions were collected and aliquots were resolved on SDS-PAGli gels. I he 
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PAGE gels were subjected to Western blotting and the pHisBot protein was detected using a 
chicken anii-r. hotulimmi I'ype A toxoid antibody as described in Example 22. 

From the Western blot analysis it was determined that the pHisBot protein begins to 
elute irom the Ni-NTA column at pH 6.0. I his is consistent with the predicted elution of a 
5 His-tagged protein monomer at pH 5.9. 

These results demonstrate thai the pH at which the pi iisBot protein is protonated 
(released) from Ni-NTA resin in PBS buffer is pH 6.0. 

ii) Binding Specificity (Imidazole Competition) 

10 In order to define purification conditions under which the native E. cod proteins could 

be remo\ed from the Ni-NTA column while leaving the pHisBol protein bound to the 
column, the following experiment was performed. A lysate of pMisBot culture was prepared 
in 50 niM NallPO,. 0.5 M NaCI. 8 niM imidazole (pH 7.0). This lysate was applied to a 3 
ml Ni-NTA column equilibrated in 50 mM NaHPO^. 0.5 M NaCI (pH 7.0) using an Econo 

15 chromatography system (BioRad). A flow rate of 0.2 ml/min (3-4 column volumes/hr) was 
utilized. 1 he column was washed with 50 mM NaHPO^. 0.5 M NaCi (pH 7.0) until the 
absorbance of the elute returned to baseline. The flow rate was then increased to 2 m!/min. 

fhe column was eluted using an imidazole step gradient |in 50 mM NaHPO,. 0.5 M 
NaCI (pi! 7.0)|. Elution steps were 20 mM, 40 mM, 60 mM. 80 mM. 100 mM. 200 mM. 1.0 

20 M imidazole, followed by a wash using 0.1 mM EDTA (to strip the nickel from the column 
and remove any remaining protein). In each step, the wash was continued until the OD,j,u 
returned to baseline, f ractions were resolved on SDS-PAGE gels. Western blotted, and 
pHisBot protein detected using a chicken anti-C. hotidinum Type A toxoid antibody as 
described in l:.\ample 22. Duplicate gels were stained with Coomassie blue to delect eluted 

25 protein in each fraction. 

1 he results of the PAGE analysis showed that most of the non-specifically binding 
bacterial protein was removed by the 20 mM imidiazole wash, with the remaining bacterial 
proteins being removed in the 40 and 60 mM imidazole washes, fhe pHisBol protein began 
lo elute at 100 mM imidazole and was quantitatively eluted in 200 mM imidazole. 

.10 These results precisely defined the window of imidazole wash stringency that 

optimally removes E cali proteins from the column while specillcally retaining the pHisBot 
protein in this buffer. These results provided conditions under which the pHisBoi protein can 
be purified free of contaminating host proteins. 
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ill) Purification Buffers And Optimized 
Purification Protocols 

A variety of purificalion parameters were tested during the development of an 
optimized protocol for batch purification of soluble pHisBol protein. The results of these 
analyses arc summarized below. 

Batch purifications v^cre performed (as described in l:xamplc 24d) using several 
buffers to determine if alternative buffers could be utilized for binding of the pHisBot protein 
to the Ni-N TA column. It was determined that quantitative binding of pHisBol protein to the 
Ni-NTA resin was achieved in either Tris-HCl (pH 7/)) or NaHPO^ (pi I 8.0) huffcns. 
Binding of the pHisBot protein in NaHPOj buffer was not inhibited using 5 mM. 8 mM or 60 
mM imidazole. (Quantitative clution of bound plIisBol protein was obtained in buffers 
containing 50 mM NaHP04, 0.3 M NaCl (pH 3.5-4.0), with or without 10% glycerol. 
However, quantitation of soluble affinity purified pHisBoi protein before and after a freeze 
thaw ( following several weeks storage of the affinity purified elute at -20°O revealed that 
^)4% of the protein was recovered using the glycerol-containing buffer, hut only 68% of the 
protein was recovered when the buffer lacking glycerol was employed. I his demonstrates 
that glycerol enhanced the solubility of the pHisBot protein in this low pH buffer when the 
eluied protein was stored at freezing temperatures (c.tf.. -lO^'C). Neutralization of pH by 
addition of NaH.PO^ buffer did not result in obvious protein precipitation. 

It was determined that quantitative binding oi plIisBoi protein using the batch format 
occurred alter 3 hrs (Figure 30), but not after 1 hr of binding at 4°C' (the resin was stirred 
during binding). Figure 30 depicts a Coomaisse blue stained SDS-PA(iF gel (7.5% 
acrylamide) containing samples of proteins isolated during the purificalion of pHisBol protein 
Irom lysate prepared from the BL21(r)E3)pLysS host. Hach lane was loaded with 5 |.il of 
protein sample mixed with 5 i^l of 2X sample buffer and processed as described in Example 
22b. Lane 1 contains high molecular weight protein markers (BioRad). Lanes 2 and 3 
contain protein eluted from the Ni-NTA resin. Lane 4 contains soluble protein alter a 3 hr 
batch incubation with the Ni-NTA resin. Lanes 5 and 6 contain soluble and total protein, 
respectively. Figure 30 demonstrates that the pHisBot protein is completely soluble [compare 
Lanes 5 and 6 which show that a similar amount of the 50 kD pIlisBot protein is seen in 
both: if a substantial amount (greater than 20%) of the pHisBot protein were partially 
insoluble in the host ccIL more pHisBot protein would be seen in lane 6 (total protein) as 
compared to lane 5 (soluble protein)]. Figure 30 also demonstrates that the pHisBol protein is 
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completely removed from the lysate after batch absorption with the Ni-NTA resin for 3 hours 
(compare Lanes 4 and 5). 

I he reported high aflinity interaction of the Ni-NTA resin with His-tagged proteins 
i X 10 ' ^ at pH 8.0) suggested that it should be possible to manipulate the resin-protein 
5 complexes without signilkant release of the bound protein. Indeed, it was determined thai 
after the recombinant protein was bound to the Ni-NTA resin, the resin-pHisBot protein 
complex was highly stable and remained bound following repealed rounds of centrifutiation of 
the resin for 2 min at 1600 x ,t;. When this centrifugation step was performed in a 50 ml lube 
(KalconK a tight resin pellet formed. This allowed the removal of spent soluble lysate by 
10 pouring off the supernatant followed by resuspension of the pellet in wash buffer, Further 

washes can be peribrmcd by centrifugation. The ability to perform additional washes permits 
the dcveiopment of protocols for batch absorption of large volumes of lysate with removal of 
the l> sate being peribrmcd simply by centrifugation following binding of the recombinant 
protein lo the resin. 

1"^ A siniplitled. iniegrated purification protocol was developed as follows. A soluble 

lysate was made by resuspending the induced cell pellet in binding buffer |50 niM NallPO^. 
0.5 M NaCL 60 mM imidazole (pH X.O)j. sonicating 4 x 20 sec and centrifuging for 20 min 
at 10.000 \ ,t;. NIMO was added to 0.1% and Ni-NTA resin (equilibrated in binding buffer) 
was added. Light milliliters of a 1:1 slurry (resin:binding buffer) was used per liter of 

20 starting culture. I he mixture was stirred for 3 hrs at 4°C. The slurry was poured into a 
column ha\ nig a 1 cm internal diameter (BioRad). washed with binding buffer containing 
0.1% NP40. then binding buffer until baseline was established (these steps ma\ alternatively 
he performed by centrifugation of the resin, resuspension in binding buffer containing NIMO 
followed b\ centrifugation and resuspension in binding buffer). Imidazole was removed by 

25 washing the resin with 50 mM NaHPO,, 0.3M NaCI (pH 7.0). Protein bound to the resin was 
eluted using the same buffer (50 mM NaHPO^. O.jM NaCI) having a reduced pH (pH 3.5- 
4.0). 

A pilot puritlcation was performed following this protocol and yielded 18 mg/liter 
aiflniiy-purified pHisBot. The pHisBol protein was greater than 90% pure as estimated b\ 
30 Cooma.ssie staining ol' an SDS-P.ACit gel. This represents the highest observed yield ol' 
soluble afllnity-puritied pIlisBot protein and this protocol eliminates the need for separate 
imidazole-coniaining binding and wash butfers. In addition to providing a simplified and 
efficient protocol for the affinity purification of recombinant pHisBot protein, the above 
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results provide a varieiy of purification conditions under which pHisBot protein can be 
isolated. 

EXAMPLE 26 

5 The pHisBot Protein Is An Effective Immunogen 

hi Example 23 it was demonstrated that neutrahzing antibodies arc generated in mouse 
serum after nasal immunization with the pMBoi protein. However, the pMBot protein was 
found to copurify with significant amounts of endotoxin which could not be easily removed. 
0 The pHisBot protein, in contrast, could be isolated free of signiilcant endotoxin contamination 
making pl IisBol a superior candidate for vaccine production. 1 o further assess the suitability 
of pllisBot as a vaccine, the immunogenicity of the pHisBot protein was determined and a 
comparison of the relative immunogenicity of pMBot and pHisBol proteins in mice was 
pertbrmed as follows. 

^ I wo groups of eight liALBc mice were immunized with either pMBot protein or 

plIisBot protein using (ierbu CiMDP adjuvant (CC Biotech). pMBot protein (in PBS 
containing 10 mM maltose) or pHisBoi protein (in 50 mMNaHPO,, 0.3 M NaC'l. 10% 
glycerol, pl l 4.0) was mixed with Cierbu adjuvant and used to immunize mice. Each mouse 
received an IP injection of 100 |iil antigen/adjuvant mix (50 |.ig antigen plus 1 pg adjuvant) on 

0 day 0. Mice were boosted as described above with the exception that the route of 

administration was IM on day 14 and 28. The mice were bled on day 77 and anii-C. 
homlimim I ype A toxoid liters were determined using serum ctJilected from individual mice 
in each group (as described in Example 23). The results are shown in Table 41. 
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TABt>:4l 
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The resulis shown above in Table 41 demonsirale that both tlic pMBoi and pHisBot 
proteins arc imniunogenic in mice as 100% of the miee (8/8) in each group seroconvcried 
Irom non-immune to immune status. The rcsuUs also show that the average tiler ofanti-C' 
hotulinum 1 ypc A toxoid igCi is 2-3 fold higher after immunization with the pHisBol protein 
relaii\ e to immunization with the pMBot protein. This suggests that the p! lisBoi protein may 
be a superior immunogen to the pMBot protein, 

EXAMPLE 27 

immunization With The Recombinant 
pHisBot Protein Generates Neutralizing Antibodies 

The results shown in Example 26 demonstrated that both the pi lisBot and pMBot 
proteins were capable of inducing high titers of anli-C. bniulbmm type A toxoid-reaciive 
antibodies in immunized hosts. The ability of the immune sera from mice immunized with 
either the pHisBot or pMBot proteins to neutralize botulinum type A toxoid in vivo was 
determined using the mouse neutralization assay described in Example 23b. 



- 169- 



PCT/US97/15394 



The iwo groups of eight BALBc mice immunized with either pMBot protein or 
pHisBol protein in Hxample 26 were boosted again one week after the bleeding on day 77. 
I he boost was performed by mixing pMBoi protein (in PBS containing 10 mM maltose) or 
pHisBol protein (in 50 mM NaHPOj. 0.3 M NaCL 10% glycerol, pll 4.0) with Gerbu 
5 adjuvant as described in Example 26. Each mouse received an IP injection oi' 100 \x\ 

antigen/adjuvant mix (50 |.ig antigen plus I ^g adjuvant). The mice were bled 6 days after 
this boost and the scrum from mice within a group was pooled. Scrum from preimmune mice 
was also collected (this scrum is the same serum described in the footnote to labic 41). 
The presence of neutralizing antibodies in the pooled or preimmune serum was 

10 detected by challenging mice with 5 LD^ij units of type A toxin mixed with 100 f.il of pooled 
serum. The challenge was peribrmed by mixing (per mouse to be injected) 100 ol' serum 
from each pool with 100 pi of purified type A toxin standard (50 LD,„ /ml prepared as 
described in I:xample 23b) and 500 fil of gel-phosphate. I he mixtures were incubated for 30 
min at room temperature with occasional mixing. Each of four mice were injected IP with 

15 the mixtures (0 7 ml/mouse), fhe mice were observed for signs of botulism for 72 hours- 
Mice receiving toxin mixed with serum from mice immunized with either the pllisl^ot or 
pMBot proteins showed no signs of botulism intoxication. In contrast, mice receiving 
preimmune serum died in less than 24 hours. 

These results demonstrate that antibodies capable of neutralizing (' hntu/inum type A 

20 toxin arc induced when either of the recombinant C. hofulinum C fragment proteins pUisBol 
or pMBot are used as immunogens. 

{> 

EXAMPLE 28 

Cloning And Expression Of The C Fragment of C. botulhmm 
25 Serotype A foxin In £. tali Utilizing A Native Gene Fragment 

In Example 22 above, a synthetic gene was used to express the C fragment ot C 
hofulinum serotype A toxin in £. co!i. The synthetic gene replaced non-preferred {i.e. rare) 
codons present in the C fragment gene with codons which are preferred by A. coli. 1 he 
30 synthetic gene was generated because it was been reported that genes whicii have a high A/T 
coiUcnt (such as most clostridial genes) creates expression dilficulties in E, coli and yeast. 
Furthermore. LaPenoticre ct at. suggested that problems encountered with the stability (non- 
fusion constructs) and solubility (MBP fusion constructs) of the C fragment of (' hotuiinum 
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serotype A toxin when expressed in E. coli was most likely due to the extreme A/T richness 
of the native C. hotulimmi serotype A toxin gene sequences (LaPenotiere. a al., supra). 

In this example, it was demonstrated that successful expression of the C Iragment of 
( * hottilinttm type A toxin gene in £. co// does not require the elimination of rare codons 
5 {i.e., there is no need to use a synthetic gene). This example involved a) the cloning of the 
native C iVagmenl of the C hofulimtm serotype A toxin gene and construction oi^ an 
expression vector and b) a comparison of the expression and purification yields of C 
hoiulmum serotype A C fragments derived from native and synthetic expression vectors. 

10 a) Cloning Of The Native C Fragment Of The C hotuUnum 

Serotype A Toxin Gene And Construction Of An Expression 
Vector 

fhc seri)type A toxin gene was cloned from C bntid'inum genomic DNA using PCR 
amplincation. The following primer pair was employed: 5*-CGCCATGGCTAG 

15 ATTA rTATCTACATTTAC-3' (5' primer, Nco\ site underlined: SEQ 113 NO:29> and 

5'-(iC AACiCTT CTTCiACAGACTCAT(;TAG-3' {y primer. Hind\\\ site underlined: SEQ ID 
N():3())- ('. hoiulimim type A strain was obtained from the American i ype Culture Collection 
(ATCC#I0397) and grown under anaerobic conditions in Terrific broth medium. High 
molecular-weighi C boiiilhuim DNA was isolated as described in Example It. The integrity 

20 LUid yield oi genomic DNA was assessed by comparison with a serial dilution of uncut lambda 
DNA after electrophoresis on an agarose gel. 

The gene fragment was cloned by PCR utilizing a proofreading thermostable DNA 
poKmcrase (native Pfu polymerase K PCR amplification was performed using the above 
primer pair in a 50^1 reaction containing lOmM Tris-HCl (pH 8.3). 50mM KCl. 1.5mM 

25 iVIgCL. 200^M each dNTP. 0.2/iM each primer, and 50ng C. hoiulinum genomic DNA. 
Reactions were overlaid with 100/xl mineral oil, healed to 94*C 4 min. 0.5^1 native Pfu 
polymerase (Stratagcne) was added, and thirty cycles comprising 94^ for I min. for 2 
min. 72°C for 2 min were carried out followed by 10 min at 72T. An aliquot (lO/zl) of the 
reaction mixture was resolved on an agarose gel and the amplified native C fragment gene 

3(» was get purified using the Prep-A-Gene kit (BioRad) and ligated to pCRScripi vector DNA 
(Stratagcne). Recombinant clones were isolated and confirmed by restriction digestion, using 
standard recombinant molecular biology techniques jSambrook e/ al. (1989). supra]. In 
addition, the sequence of approximately 300 bases located at the 5' end of the C fragment 
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coding region were obtained using standard DNA sequencing methods. The sequence 
obtained was identical to that of the published sequence. 

An expression vector containing the native C\ hoiulimtm serotype A C fragment gene 
was created by ligation ot the NcoX-HinAlW fragment containing the C fragment gene from the 
pCRScript clone to /V7?cl-///wdin restricted pETHisa vector (l:\ample 18b). I he Nco\ and 
S'he\ sites were tilled in using the Klenow enzyme prior to ligation: these sites were thus 
blunt-end ligated together. The resulting construct was termed pHisBotA (native). pHisBotA 
(native) expresses the i\ boiulinum serotype A C fragmcnl with a his-lagged N terminal 
extension which has the following sequence: 

Met(nyHisHislIisIlisllisHisHisHisHisHisSerSerGlyFIis//c^G///G^^^^^^^ (SEQ ID 

N():24). where the underlining represents amino acids encoded by the C\ hotulinum C 
fragment yene (this N terminal extension contains the recognition site for l-actorXa protease, 
shown in italics, which can be employed to removed the polyhistdine tract from the N- 
lerminus of the fusion protein). The pHisBot (native) construct expresses the identical protein 
as the pIlisBoi construct (Ex. 24c: herein after the pHisBotA) which contains the synthetic 
gene. 

The predicted DNA sequence encoding the native (' ho/ulinum serotype A C* fragment 
gene contained wiihm pHisBotA (native) is listed in SEQ H) N():3l jihe start of translation 
(Al (J) is located at nucleotides 108-1 10 and the stop of translation (TAA) is located at 
nucleotides 1 4^)4- 14% in SEQ ID N():3I1 and the corresponding amino acid sequence is listed 
in SEO ID NC):26 (/.t'.. the same amino acid sequence as that produced by pHisBotA 
containing synthetic gene sequences). 

b) Comparison Of The Expression And Purification Yields Of 
C hotulhuim Serotype A C Fraftments Derived From Native 
And Synthetic Expression Vectors 

Recombinant piasmids containing either the native or the synthetic C hontlinitm 
serotype A C fragment genes were transformed into E. co/i .strain fil2UDI*3) pLysS and 
protein expression was induced in 1 liter shaker llask cultures, l oial protein extracts were 
isolated, resolved on SOS-PAGE gels and C hondmum C fragment protein was identified by 
Western analysis utilizing a chicken anti-(^ howlinum serotype A toxoid anti.serum as 
described in Example 22. 
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Brielly. I liter (2XYT + 100 fig/ml ampicillin and 34 Mg/ml chloramphenicol) cultures 
of bacteria harboring either the pHisBotA (synthetic) or pHisBotA (native) plasmids in the 
BI2l(Dri3) pi,ysS strain were induced to express recombinant protein by addition oi IPTG to 
ImM. Cultures were grown at 30-32X. IPTG was added when the cell density reached an 

" 5 OD,,,H, 0.5-1-0 and the induced protein was allowed to accumulate for 3-4 hrs after induction. 
The cells were cooled for 15 min in a ice water bath and then cenirifuged for 10 min 
ai 5000 rpm in a JAIO rotor (Beckman) al 4°C. The cell pellets were resuspended in a total 
volume of 40 mis IX binding buffer (40 mM imidazole. 0.5 M NaCI. 50 mM NaPOj. pH 
X.O). transferred to two 50 ml Oakridge tubes and frozen al -70°C for at least 1 hr. The tubes 

10 were then thawed and the cells were lyscd by sonication (using four successive 20 second 
bursts) on ice. I he suspension was clarified by centrifugation 20-30 min at 9.000 rpm 
(10,000,1:) in a JA-17 rotor. The soluble lysate was batch absorbed to 7 ml of a 1:1 slurry of 
NiNTA resin:binding buffer by stirring 2-4 hr at 4°C. The slurry was centrifugcd for I min 
at 500.^ in 50 ml tube (Falcon), resuspended in 5 mis bindint; buffer and poured into a 2.5 cm 

15 diameter column (BioRad). The column was attached to a UV monitor (ISCO) and the 
column was washed with binding buffer until a baseline was established. Imidazole was 
removed by washing with 50mM NaPOj. 0.3 M NaCl. 10% glycerol. pH 7.0 and bound 
protein was elutcd using 50mM NaPOj. 0,3 M NaCl, 10% glycerol, pH 3.5-4.0. 

f he eluled proteins were stored at 4''C. Samples of total, soluble, and eluted proteins 

20 were resolved by .SDS-PAG1£. Protein samples were prepared for electrophoresis by mixing 
Ifii total ( I ) or soluble (S) protein with 4 yi\ PBS and 5 2X SDS-PAGH sample buffer, or 
5 |.il eluted (t) protein and 5 ^il 2X SDS-P AGE sample buffer, fhe samples were heated to 
95 C for 5 min, then cooled and 5 or 10 ^ils were loaded on 12.5% SDS-PAGE gels. Broad 
range molecular weight protein markers (BioRad) were also loaded to allow the MW of the 

25 identified fusion proteins to be estimated. After electrophoresis, protein was detected either 
generally by staining gels with Coomassie blue, or specifically, by blotting to nitrocellulose 
for Western blot detection of specific immunoreactive protein. 

I or Western blot analysis, the gels were blotted, and protein transfer was confirmed by 
Ponceau S staining as described in Ii.xample 22. After blocking the blots for I hr at room 

30 temperature in blocking buffer fPBST and 5% milk). 10 ml of a 1/500 dilution of an anti-C 
hoiiilinum toxin A IgY PEG prep (Ex. 3) in blocking buffer was added and the blots were 
incubated for an additional hour at room temperature. The blots were washed and developed 
using a rabbit anti-chicken alkaline phosphatase conjugate (Boehringer Mannheim) as the 
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secondary antibody as described in Ex. 22. This analysis detected C handinum toxin A- 
reactive proteins in the pHisBotA (native and synthetic) protein samples (corresponding to the 
predicted full length proteins identified by Coomassie staining). 

A gel containing proteins expressed from the pHisBot and pHisBot (native) constructs 
during various stages of purification and stained wilh Coomassie blue is shown in Figure 3 1 . 
In Figure 31. lanes 1-4 and 9 contain proteins expressed by ihe pHisBolA construct {/.t'., the 
synthetic gene) and lanes 5-8 contain proteins expressed by the pHisBotA (native) construct. 
Lanes 1 and 5 contain total protein extracts; lanes 2 and 6 contain soluble protein extracts; 
lanes 3 and 7 contain proteins which flowed through the NiM TA columns: lanes 4. 8 and 
contain protein eluled from the NiNTA columns and lane 10 contains molecular weight 
markers. 

The above purification resulted in a yield of 3 mg (native gene) or 1 1 mg (synthciic 
gene) of affinity purified protein from a 1 liter starling culture, of which at least <>0-95% of 
the protein was a single band of the predicted MW (50kd) and immunoreactivity for 
recombinant ( ' hoin/hmm serotype A C fragment protein. Other than the level ol* expression, 
no diflerence was observed between the native and the synthetic gene expression systems. 

fhese results demonstrate that soluble ( * hotulimtm serotype A C IVagmeni protein can 
be expressed in E. coll and purified utilizing either native or synthciic gene sequences. 

EXAMPLE 29 

Generation Of Neutralizing Antibodies Using A Recombinant 
( ' hnfu/inum Serotype A C Fragment Protein Containing A Six Residue I lis- 1 ag 

In lixampic 27. neutralizing antibodies were generated utilizing the pHisBolA protein, 
which contains a histidine-tagged N-terminal extension comprising H) hisiidine residues. To 
determine if the generation of neutralizing antibodies is dependent on the presence of this 
particular his-tag, a protein containing a shorter N-terminal extension (comprising (> histidinc 
residues) was produced and tested for the ability to generate neutralizing antibodies, fhis 
example involved a) the cloning and expression of the p6HisBoiA(syn) protein and b) the 
generation and characterization of hyperimmune scrum. 
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a) Cloning And Expression Of The p6HisBotA(syn) Protein 

The p6HisDolA(syn) construct was generated as described below: the term "syn" 
designates the presence of synthetic gene sequences. This construct expresses the C frgament 
of the C\ hondimim serotype A toxin with a histidine-lagged N terminal extension having the 

' 5 following sequence: MetHisHisHisHisHisHis MetAla (SHQ ID NO:32); the amino acids 

encoded by the botuiinal C fragment gene are underlined and the vector encoded amino acids 
are presented in plain type. 

6XHis oligonucleotides [5*-TATGCATCACCATCACCArCA-3^ (SEQ ID NO:33) and 
S'-CATGTGATGGTGATGGTGATGCA-^' (SEQ ID NO:34) were annealed as follows. One 

10 microgram of each oligonucleotide was mixed in total of 20 pi IX reaction buffer 2 (NEB) 
and the mixture was healed at 70°C for 5 min and then incubated at 42°C for 5 min. 1 he 
annealed oligonucleotides were then ligated with gcl purified NdcVHindWl cleaved pET23b 
(17 promoter) or pET21b (T7lac promoter) DNA and the gel purified NcoMHindlW C 
hnndimmi serotype A C fragment synthetic gene fragment derived from pAlterBot (Ex. 22), 

15 Recombinant clones were isolated and confirmed by restriction digestion. The DNA sequence 
encoding the 6X his-tagged BotA protein contained within p6HisBotA(syn) is listed in SEQ 
ID N(3:35. The amino acid sequence of the p6XHisBotA protein is listed in SEQ ID N():36. 

The resulting recombinant p6XHisBotA plasmid was transformed into the BL2l(DE3) 
pLysS strain, and I liter cultures were grown, induced and harvested as described in Example 

20 28. I lis-iaggcd protein was purified as described in Example 28. with the following 

modifications, fhc binding buffer (BB) contained 5 mM imidazole rather than 40 mM 
imidazole and NIMO was added to the soluble lysate to a final concentration of 0T%. I he 
bound material was washed on the column with BB until the baseline was established, then 
the column was washed successively with BB+20 mM imidazole and BB+4() mM imidazole. 

25 The column was eluted as described in Example 28. 

In the case of the pET23-derived expression system, high level expression of insoluble 
6HisBotA protein was induced. The pET2 1 -derived vector expressed lower levels of soluble 
protein that bound the NiNTA resin and eluted in the 40 mM imidazole wash rather than 
during the low pH elulion. These results (ic. low level expression of a soluble protein) are 

30 consistent with the results obtained with pHisBotA protein (Ex. 25): the pHisBotA construct, 
like the pET2 1 -derived vector, contains the T71ac rather than T7 promoter. 

The 6HisBotA protein thus clutes under less stringent conditions than the lOX 
histidine-containing pHisBot protein ( 100-200 mM imidazole: Ex. 25) presumably due to the 
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reduction in the length of the his-tag. The eluted protein was of the predicted size [Le.. 
shghtly reduced in comparison to pHisBolA protein]. 

h) Generation And Characterization Of Hyperimmune Serum 

5 Eight BALBc mice were immunized with purified 6HisIk)lA protein using Gerbu 

GMDP adjuvant (CC Biotech). The 40 niM imidazole eluiion was mixed with Gerbu 
adjuvant and used to immunize mice. P^ach mouse received a subcutaneous injection of 100 
|.d antigen/adjuvant mix (12 |ig antigen + 1 yig adjuvant) on day 0. Mice were subcuianeouslv 
b(X)sied as above on day 14 and bled on day 28. Control mice received pl iisBotB protein 

0 (prepared as described in Ex. 35 beiow) in Gerbu adjuvant, 

Anti-C hotulinum serotype A toxoid titers were determined in serum from individual 
mice from each group using the ELISA described in Example 23a wiih the exception that the 
initial testing scrum dilution was 1:100 in blocking buffer containing 0.5% fween 20. 
followed by serial 5-fold dilutions into this buffer. The results of the EI, ISA demonstrated 

5 that seroconversion (relative to control mice) occurred in all 8 mice. 

fhe ability of the anii-C. hotulinwYi serotype A C fragment antibodies present in scrum 
from the immunized mice to neutralize native C. hotulinum type A toxin was tested using the 
mouse neutralization assay described in Example 23b. The amount of neutralizing antibodies 
present in the serum of the immunized mice was determined using serum antibody titrations. 

0 The various serum dilutions (0.01 ml) were mixed with 5 LD<„ units of C. hotulinum type A 
toxin and the mixtures were injected IP into mice, fhe neutralizations were performed in 
duplicate. 1 he mice were then observed for signs of botulism for 4 days. Undiluted serum 
was found to protect 100% of the injected mice while the 1:10 diluted .serum did not. This 
corresponds to a neutralization titer of 0.05-0.5 lU/mL 

5 f hcse results demonstrate that neutralizing antibodies were induced when the 

GlIisBotA protein was utilized as the immunogen. Furthermore, these results demonstrate that 
seroconversion and the generation of neutralizing antibodies docs not depend on the specific 
N terminal extension present on the recombinant C hotulinum type A C fragment proteins. 
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EXAMPLE 30 

Construction Of Vectors For The Expression Of His-Tagged 
(*. hondinum l ype A Toxin C Fragment Protein Using the Synthetic Gene 

A number of expression vectors were constructed which contained the synthetic C\ 
hntidimim type A toxin C fragment gene. These constructs vary as to the promoter (T7 or 
T71ac) and repressor elements (laclq) present on the plasmid. The T7 promoter is a stronger 
promoter than is the T71ac promoter. The various constructs provide varying expression 
levels and varying levels of plasmid stability. This example involved a) the construction of 
expression vectors containing the synthetic C. hotulinum lype A C fragment gene and b) the 
determination of the expression level achieved using plasmids containing either the kanamycin 
resistance or the ampicillin resistance genes in small scale cultures. 

a) Construction Of Expression Vectors Containing The 
Synthetic C botulinum Type A C Fragment Gene 

Expression vectors containing the synthetic C. hondinum lype A C fragment gene were 
engineered to utilize the kanamycin resistance rather than the ampicillin resistance gene. This 
was done for several reasons including concerns regarding the presence of residual ampicillin 
in recombinant protein derived from plasmids containing the ampicillin resistance gene. In 
addition, ampicillin resistant plasmids are more difficult to maintain in culture: the p- 
lactamase secreted by cells containing ampicillin resistant plasmids rapidly degrades 
extracellular ampicillin. allowing the growth of piasmid-negative cells. 

A second altered feature of the expression vectors is the inclusion of laclq gene in the 
plasmid. This repressor lowers expression from lac regulated promoters (the chromosomally 
located, lactose regulated T7 polymerase gene and the plasmid located T7lac promoter). This 
down regulates uninduced protein expression and can enhance the stability of recombinant cell 
lines. The final alteration to the vectors is the inclusion of either the r7 or 1 7lac promoters 
that drive high or moderate level expression of recombinant protein, respectively. 

The expression plasmids were constructed as follows. In all cases, the protein 
expressed is the pHisBotA(syn) protein previously described, and the only differences between 
constructs is the alteration of the various regulatory elements described above. 
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i) Construction Of pHisBotA(syn) kan T7iac 

The pHisBotA(syn) kan T71ac construct was made by inserting the SapUXho] fragment 
containing the C hoiuUnum type A C fragment from pHisBoiA(syn) into pET24 digested with 
SapUXhol (Novagcn: fragment contains kan gene and origin of replication), fhe desired 
construct was selected for kanamycin resistance and confirmed by restriction digestion. 

ii) Construction Of pHisBotA(syn) kan iaclq T7lac 

I he pUisBotA(syn) kan laclq T71ac construct was made by inserting the Xhal/Hindlll 
tragment containing the C. hotulmuw type A C fragment from piUsBotA(syn)kanr71ac into 
the pE r24a vector digested with XhuMHinAWl. The resulting construct was conllrmed by 
restriction digestion. 

iii) Construction Of pHisBotA(syn) kun iaclq T7 

The pilisBotA(syn) kan laclq T7 construct was made by inserting the XhaXlHinAWX 
fragment containing the hondimtm type A C fragment from pIlisBotA(syn) kan iaclq T7iac 
into A7ji/I////;7dIll-digestcd pHisBotB(syn) kan laclq T7 (described in Ex .17c below). The 
resulting construct was confirmed by restriction digestion. 

h) Determination Of The Expression Level Achieved Using 

Plasmids Containing Either The Kanamycin Resistance Or 
The Ampicillin Resistance Genes In Small Scale Cultures 

One liter cultures of pHisBotA(syn) kan T7lac/B121(DE3)pLysS and pHisBotA(syn) 
amp T71ac/B12UDE3)pLysS [this is the previously designated pHisBotA(syn) construct! were 
grown, induced and his-tagged proteins were purified as described in Example 28. No 
differences in yield or protein integrity/purity were observed. 

These results demonstrate that the antigen induction levels from expression constructs 
were not affected by the choice of ampicillin versus kanamycin antibiotic resistance genes. 
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EXAMPLE 31 

Fermentation Of Cells Expressing Recombinant Botulinal Proteins 

-a) Fermentation Culture Of Cells Expressing Reeombinant 
5 Botulinal Proteins 

I'ermcntation cultures were grown under the followin*: conditions which were 
optimized lor growth of the BL21(DE3) strains containing pFT derived expression vectors. 
An overnight 1 liter feeder culture was prepared by inoculating of 1 liter media (in a 2L 
shaker flask) with a fresh colony grown on an LB kan plate. The feeder culture contained: 

10 600 mis nitrogen source [20 gm yeast extract (BBL) and 40 gm tryptone {BBL)/600 mis]. 200 
mis 5X fermentation salts (per liter: 48.5 gm K.HPO^, 12 gm NaHvPO^-H^O. 5 gm NH.CL 
2.5 gm NaCI), 180 mis dH.O. 20 mis 20% glucose, 2 mis 1 M MuSO,, 5 mis 0.05M CaCU 
and 4 mis of a 10 mg/ml kanamycin stock. All solutions were sterilized by autoclaving, 
except the kanamycin slock which was filter sterilized. 

\5 An aliquot (5 ml) of the feeder culture broth was removed prior to inoculation, and 

grown lor 2 days at 37°C as a culture broth sterility control. Growth was not observed in this 
control culture in any of the fermentations performed. 

I he inoculated feeder culture was grown for 12-15 hrs (ON) at 30-37°C. Care was 
taken to prevent oversaturation of this culture. The saturated feeder culture was added to lOL 

20 of fermentation media in fermenter (BioflolV. New Brunswick Scientific, lldison. NJ) as 
follows, riic fermenter was sterilized 120 min at 121 ''C with dH,0. The sterile water was 
removed, and fermentation media added as follows: 6 liters nitrogen st)urcc. 2 liters 5X 
fermentation salts. 2 liters 2% glucose. 20 mis I M MgSO,. 50 mis 0.05 M CaCK. 2.5-3.5 
mis Maeol 1^ 400 antifoam (PPG Industries Inc.. Gurnee. IL), 40 nils iOmg/ml kanamycin and 

25 10 mis trace elements (8 gm FeS0^»7H,O. 2 gm MnSO/H.O. 2 gm AlCI,*6H.O. 0.8 gm 
CoCl-61I.O. 0.4 gm ZnSO/7H.O. 0.4 gm Na,Mo04'2H,0. 0.2 gm CuCK-2H.O. 0.2 gm 
NiCl,. 0.1 gm H,BOy200mls 5 M HCt). All solutions were sterilized by autoclaving, except 
the kanamycin stock which was filter sterilized. Fermentation media was prewarmed to 37°C 
before the addition of the feeder culture. 

30 After the addition of the feeder culture, the culture was fermented at 37°C. 400 rpm 

agitation, and 10 l/min air sparging. The DO^ control was set to 20% PID and dissolved 
oxygen levels were controlled by increasing the rate of agitation from 400-850 rpm under 
IX)-; control. DO-, levels were maintained at greater than or equal to 20% throughout the 
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entire fermentation. When agitation levels reached 500-600 rpm the temperature was lowered 
to 30X to reduce the oxygen consumption rate. Culture growth was continued until 
endogenous carbon sources were depleted. In these fcrmenlalions, glucose was depleted first 
[monitored with a glucose monitoring kit (Sigma)], followed by assimilation of acetate and 
5 other acidic carbons (monitored using an acetate lest kit (Boehringcr Mannheim)]. During the 
assimilation phase, the pH rose from 6.6-6.8 (starting pH) to 7.4-7.5. at which lime the bulk 
of the remaining carbon source was depleted. This was signaled by a drop in aiiitaiion rate 
(from a maximum of 700-800 rpm) and a rise in DO, levels >30%. This corresponds to a 
OD^H, reading of 18-20/ml. At this point a fed batch mode was initialed, in which a feed 

10 solution of 50% glucose was added at a rate of approximately 4 gm glucose/liter/hr. The pH 
was adjusted to 7.0 by the addition of 25% H5PO4 (approximately 60 mis). Culture growth 
was continued and reached peak oxygen consumption within the next 3 hrs of growth (while 
the remaining residual non-glucose carbt)n sources were assimilated). This phase is 
characterized by a slow increase in pIL and air sparging was increased lo l5L/min. to keep 

15 the maximum rpm below 850. Once the residual acidic carbon sources are depleted the 
agitation rate decreases to 650-750 rpm and the pH begins to drop. pH control was 
maintained ai 7.0 PID by regulated pump addition of a sterile 4M NaOM solution which was 
consumed at a steady rale for the remainder of the fermentalion. (irowih was continued at 
30°C. and the cultures were grown lineariy at a growth rate of 4-7 0D^(„, unils/hr, lo at least 

20 81.5 OD^„, units/ml (>30g/l dry cell weight) without induction. Aniifoam (a 1:1 dilution with 
filter sterilized 100% eihanol) was added as necessary throughout the fermentation lo prevent 
foaming. 

During the fed batch mode, glucose was assimilated immediately (concentration in 
media consistently less than 0.1 gm/liter) and acetate was not produced in significant levels by 

25 the pRT plasmid/BL21(DE3) cell lines tested (approximately I gm/liter at end of 

fermentation; this is lower than that observed in harvests from shaker flask cultures utilizing 
the same strains). This was fortuitous, since high levels of acetate has been shown to inhibit 
induction levels in a variety of expression systems. The above described conditions were 
found to Ik highly reproducible between fermentations and utilizing different expression 

30 plasmids. As a resuil. glucose and acetate level monitoring were no longer preformed during 
fermentalion. 
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b) Induction Of Fermentation Cultures 

Induction with IPTG (250 mg-10 gms, depending on the expression vector and 
experiment) was initiated 1-3 hrs after initiation of the glucose feed (30-50 ODf^x/nil). The 
growth rate after induction was monitored on a hourly basis, Aliquots (5-10 ml) of cells were 
' 5 harvested at the time of induction, and at hourly intervals post-induction. Optical density 

readings were determined by measuring the absorbance at 600 nm of 10 |.il culture in 990 ^I 
PBS versus a PBS control. The growth rate after induction was found to vary depending on 
the expression system utilized. 

10 c) Monitoring Of Fermentation Cultures 

l ermenlation cultures were monitored using the following control assays. 

i) Colony Forming Ability 

An aliquots of cells were removed from the cultures at each timepoini sampled 
15 (uninduced and at various times after induction) were serially diluted in PBS (dilution 1^15 
l^il cells/3 ml PBS, dilution 2 - i 5 |.il of dilution 1/3 ml PBS, dilution 3 - 3 or 6 ^il of 
dilution 2/3mls PBS) and 100 j.il of dilution 3 was plated on an LB or TSA (trypticase soy 
agar) plate. The plates were incubated ON at 37°C and then the colonies are counted and 
scored for macro or micro growth. 

20 

ii) Phenotypic Characterization 

C olonies growing on LB or TSA plates (above) from uninduced and induced 
limepoinis were replica plated onto LB+kan, LB+chloramphcnicol (for fermentations utilizing 
LysS or pACYCGro plasmids). LB+kan i-lmM IPTG and LB plates, in this order The plates 

25 were grown 6-8 hrs at 37°C and growth was scored on each plate for a minimum of 40-50 

well isolated colonies. The percentage of cells retaining the plasmid at time of induction (i.e.^ 
uninduced cultures immediately prior to the addition of IPTG) was determined to be the # 
colonics LB+Kan (or chloramphenicol) plate/# colonics LB plate X 100%. The percentage of 
cells with mutated pET plasmids was determined to be the U colonies LB+Kan^IPTG plate/# 

30 colonies LB plate X 100%. Colonies on all LB plates were scored morphologically for E. 

colt phcnotype as a contamination control. Morphologically detectable contaminant colonies 
were not detected in any fermentation. 
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iii) Recombinant BotA Protein Induction 

A total of 10 OD^, units of cells (c^^^., 200 of cells at OD,^H»^30/ml) were removed 
from each timepoint sample to a 1 .5 ml microfugc lube and pelleted for 2 min at maximum 
rpm in a microfuge. The pellets were resuspendcd in 1 ml of 50 mM NallPO,. 0.5 M NaCl, 
5 40mM imidazole buffer (pH 6.8) containing 1 mg/ml lysozyme, I1ie samples were incubated 
for 20 min at room temperature and stored ON at -7(rC\ Samples were thawed completely at 
room temperature and sonicated 2 X 10 seconds with a Branson Sonitlcr 450 microtip probe 
at # 3 power setting. The samples were centrifuged for 5 min at maximum rpm in a 
microfugc. 

An aliquot (20 |.il) of the protein samples were removed to 20 pi 2X sample buffer, 
beibre or after ccntrifugaiion. for total and soluble protein extracts, respectively. The samples 
were healed to ^)5''C for 5 min. then cooled and 5 or 10 ^1 were loaded onto 12.5% 
SDS-rA(jR gels. High molecular weight protein markers (BioRad) were also loaded to allow 
for csiimaiion of the MW of idenlitled fusion proteins. After electrophoresis, proiein was 
15 detected either generally by staining gels with Coomassie blue, or specifically, bv blotting 
onlo nitrocellulose (as described in Lx. 28) for Western blot detection of specillc his-tagged 
proteins utilizing a NiN'fA-alkaline phosphatase conjugate exactly as described by the 
manufacturer (Qiagen). 

20 iv) Recombinant Antigen Purification 

At the end of each fermentation run, 1-10 liters of culture were harvested from the 
fermenter and ihe bacterial cells were pelleted by cenirifugation at 6000 rpm for 10 min in a 
JAIO rotor (Beckman). The cell pellets were stored frozen at -70T or utilized immediately 
without freezing. Cell pellets were resuspended to 15-20% weighi to volume in resiispension 

25 buffer (generally 50 niM NaPO^. 0.5 M NaCL 40mM imidazole, pH 6.8) and lyscd utilizing 
either sonicaiion or high pressure homogenization. 

f or sonicaiion. the resuspension buffer was supplemented with lysozyme to 1 mg/mf 
and the suspension was incubated for 20 min. at room lemp. fhe sample was then frozen ON 
at -70°('. thawed and sonicated 4 X 20 seconds at microtip maximum to reduce viscosity. 

30 For homogenization. the cells were lyzed by 2 passes through a homogenizer (Rannie 

Mini-lab type 8.30 H) at 600 Bar. Cell lysates were clarified by ccnlriiugation for 30 min at 
10.000 rpm in a JAIO rotor. 
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For IDA chromatography, samples were flocculated utilizing polyethyleneimine (PEI) 
prior to centrifucation. Cell pellets were resuspended in cell resuspension buffer (CRB: 50 
mM NaPOj, 0.5 M NaCI. 40 mM imidazole, pH 6.8) to create a 20% cell suspension (wci 
weight of cells/volume of CRB) and cell lysates were prepared as described above (sonication 
or homogenizalion). PEI (a 2% solution in dH,0, pH 7.5 with HCI) was added to the cell 
I ysate □ fnial concentration of 0.2%. and stirred for 20 niin at room temperature prior to 
centrifugation (8,500 rpm in JAIO rotor for 30 minutes at 4°C). This treatment removed 
RNA. DNA and cell wall components, resulting in a clarified, low viscosity lysaic ("PEI 
clarified lysaie"). 

His-tagged proteins were purified from soluble lysates by metal-chelate affinity 
chromatography using either a NiNTA resin (as described in Ex, 28) or an IDA (iminodiacetic 
acid! resin as described below. 

IDA resin affmitv purifications were performed utilizing a low pressure 
chromatography system (ISCO). A 7 ml (small scale) or 70 ml (large scale) Chelating 
Sepharosc Fast Flow (Pharmacia) aftlnily column was poured: in addition, a second guard 
column was poured and attached in line with the first column (to capture Ni ions that leached 
off the affinity column). The columns were washed with 3 column volumes of dH.O. The 
guard column was then removed and the affinity column was washed with 0.3 M NiSOj until 
resistivity was established, then with dli.O until the resistivity returned to baseline. The 
columns were reconnected and equilibrated with cell resuspension buffer (CRB: 50 mM 
NaPO,. 0.5 M NaCI. 40 niM imidazole, pM 6.8). The clarified sample (in CRB) was loaded. 
Flow rates were 5 ml/min for small scale columns and 20 ml/min for large scale coluinns. 
After sample loading, the column was washed with CRB until a baseline established and 
bound protein was cluted with clulion buffer (50 mM NaPO^. 0.5 M NaCL 800 mM 
imidazole. 20% glycerol, pH 6.8 or 8.0). Protein samples were stored at 4°C or -20°C. fhe 
yield of eluted protein was established by measuring the OD.j,n of the elutions. with a 1 mg/ml 
solution of protein assumed to yield an absorbance reading of 2.0. 

I he IDA columns may be regenerated and reused multiple times (>10). To regenerate 
the column, the column was washed with 2-3 column volumes of M.O. then 0.05 M FDTA 
until all of the blue/green color was removed followed by a wash with dH-»0. The IDA 
columns were sterilized with 0.1 M NaOH (using at least 3 column volumes but not more 
than 50 minutes contact time with column packing material), then washed with 3 column 
volumes 0.05 M NaP04. pH 5.0, then dll.O and stored at room temperature in 20 % eihanol. 
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EXAMPLES! 

Construction Of A Folding Chaperone Overcxpression System 

Co-ovcrexpression of the E. coli GroEL/GroES folding chaperones in a cell expressing 
a recombinant foreign protein has been reported to enhance the solubiliiy oi some foreign 
proteins that are otherwise insoluble when expressed in E coli [Gragerouu et uL (1992) Proc. 
NatL Acad. Sci. USA 89:10344]. The improvement in solubility is thought to be due to 
chaperone-mcdialed binding and unfolding of insoluble denatured proteins, thus allowing 
multiple attempts for productive refolding of recombinant proteins. By ovcrexpressing the 
chaperones. the unfolding/refolding reaction is driven by excess chaperone, resulting, in some 
cases, in higher yields of soluble protein. 

In this example, a chaperone overcxpression system, compatible with pHT vector 
expression systems, was constructed to facilitate testing chaperone-mediated solubilization of 
i\ bofulimtm type A proteins. This example involved the cloning of the GroML/KS operon 
and construction of a pl.ysS-based chaperone hypcrcxprcssion system. 

fhe GroEL/GroFS operon was PCR amplified and cloned into the pCRScripi vector as 
described in Example 28. The following primer pair was used: 5 -GGCAT 
ATG AATA lTCGTCCATTGCATGo" (SEQ ID NO:37) (5' primer, start codon of groFS 
gene converted to NdeX site (underlined)) and 5^-GG AA(}CTT GCAGGGC AA7' T/iCATCATG 
(SEQ ID NO:38) (3' primer, stop codon of groEL gene italicized, engineered /7mdin site 
underlined). Following amplification, the chaperone operon was excised as an WleVlUnCAW 
IVagment and cloned into pET23b digested with Niic\ and //mdlll. 1 his construction places 
the Gro operon under the control of the T7 promoter of the pFn'23 vector. The desired 
construct was confirmed by restriction digestion. 

1 he *f7 promotcr-Gro operon-T7 terminator expression cassette was then exci.sed as a 
BjilWIBspU (filled) fragment and cloned into BamlW (compatible with »;i;/II)/WmdIlI (filled) 
cleaved pLysS plasmid (this removed the T7 lysozyme gene). The resulting construct was 
designated pACYCGro. since the plasmid utilizing the pACYCl84 origin from the ply.sS 
plasmid. Proper construction was confirmed by restriction digestion. 

pACYCGro was transformed into BL21(DE3). cultures were grown and induced with 
1 mM IPTG as described in preceding examples. Total and soluble protein extracts were 
generated from cells removed before and after IPTG induction and were resolved on a 12.5 % 
SDS-PAGE gel and stained with Coomassie blue. This analysis revealed that high levels of 
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soluble GroEl and GroES proteins were made in the induced cells. These results 
demonstrated that the chaperone hypcr-cxpression system was functional. 

EXAMPLE 33 

5 Cirowth Of BolA/pACYCGro Cell Lines In Fermentation Cultures 

Induction of BL21(DFJ) cells lacking the LysS plasmid which contained BolA 
expression constructs grown in shaker flask or fermentation culture resulted in the expression 
of primarily insoluble BotA protein. l ermentation cultures were performed to determine if 

\i) the simultaneous overexpression of the Gro operon and recombinant C\ hontlinum type A 
proteins (BolA proteins) resulted in enhanced solubility of the recombinant BotA protein. 
I his example involved the fermentation of pHisBotA(syn)kan laclq TTIac/pACYCGro 
BLZKDEjI and pHisBolA(syn)kan laclq T7/pACYCGro BL21(DE3) cell lines. The 
fermentations were repeated exactly as described in Example 31. Chloramphenicol (34 

15 (.ig/ml) was included in the feeder and fermentation cultures. 

a) Fermentation Of pHisBotA(syn)kan laclq T71ac /pACYCGro 
BL2I(DE3) Cells 

1 or fermentation of cells containing plasmids comprising the T71ac promoter. 

20 induction was with 2 gms IPTG at I hr post initiation of glucose feed. I he OD,„i„ was 35 at 
time of induction, then 48.5, 61.5. 67 at 1-3 hrs post induction. Viable colony counts 
decreased from 0-3 hr induction (21 (13), 0, 0, 0: dilution 3 utilized 3 jal of dilution 2 celLs] 
with numbers in parenthesis for the indicating microcolonies. Of 28 colonies scored at the 
time of induction, 23 retained the pHisBotA(syn)kan laclq T71ac plasmid (kan resistant), 22 

25 contained the chaperone plasmid (chloramphenicol resistant) and no colonies at induction 
grew on IPTG+Kan plates (no mutations detected). These results were indicative of very 
strong promoter induction, since colony viability dropped immediately after induction. 

total and soluble extracts were resolved on a 12.5% SDS-PACjE gel and .stained with 
C oomassie. High level induction of Gro chaperones was observed, but very low level 

30 cxpiession of .soluble BotA protein was observed, increasing from I it» 4.0 hrs post induction 
(no expression detected in uninduced cells). The dramatically lower expression of the BolA 
antigen in the presence of chaperone may be due to promoter occlusion (/.e.. the stronger r7 
promoter on the chaperone plasmid is preferentially utilized). 
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b) Fermentation Of pHisBotA(syn)kan laclq T7/ pACYCGro 
BL21(DE3) Cells 

A fermentation utilizing the T7-driven BolA expression plasmid was performed. 
Induction was with 1 ^m IPTG at 2 hrs post initiation of glucose feed. The OD^,„,was 41 at 
lime of induction, then 51.5. 61.5. 61.5 and 66 at ]-4 hrs post induction. Viable colony 
counts decreased from 0-4 hrs induction [7K 1 (34). 1 (1). 1, 0: dilution 3 utilized 6 
dilution 2 cells) with numbers in parenthesis for the uninduccd timepoini indicating 
microcolonies. Of 65 colonies scored at the time of induction, all 65 retained both the 
pHisRoiA(syn)kan laclq T7 plasmid (kan resistant) and the chaperonc plasmid 
(chloramphenicol resistant) and no colonies at induction grew on IPTG+Kan plates (no 
mutations detected). 

l oial and soluble extracts were resolved on a 12.5% SDS-PAGi- gel and .stained with 
Coomassie. High level induction of Gro chaperones and moderate level expression of soluble 
BotA protein was observed, increasing from 1 to 4.0 hrs post induction (no expression 
detected in uninduced cells). 

A PLl-clariiled lysate (0.2% llnal cocnentration PF,I) [850 ml from gm cell pellet 
(2 liters fermentation harvest)) was purified on a large scale IDA column. A total of 78 mg 
of protein was eluted. I-\tracis from the purification were resolved on a 12.5% SDS-PAGH 
gel and stained with Coomassie. The elulion was Ibund to contain an approximately 1:1 mix 
of BolA/chapcrone protein (Figure 32). PEI lysaics prepared in this manner were t\pically 16 
()D>x(/mI. This was estimated to be 8 mg protein/ml of lysate (by BCA assay). Thus, the 
ciuted recombinant BotA protein represented 0.55% of the total soluble cellular protein 
applied to the column. 

In Figure 32, lane I contains molecular weight markers, lanes 2-9 contain extracts 
from pIIisBotA(syn)kan laclq T7/pACYCGro/BL2I(DF3) cells before or durinii purification 
on the IDA column. Lane 2 contains total protein extract: lane 3 contains soluble protein 
extract: lanes 4 and 5 eoniain PFl-clarified lysates (duplicates): lanes 6 and 7 contain ilow- 
through from the IDA column (duplicates) and lanes 8 and 9 contain IDA column elute (lane 
9 coniains 1/10 the amount applied to lane 8). 

These results demonstrate, that although the majority of the BolA protein produced 
was insoluble, 20 mg/liter of soluble recombinant BolA protein can be purified utilizing the 
pHisBotA(syn)kan laclq I7/pACYCGro/BL21(DE3) expression system. 
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EXAMPLE 34 

Purification Of Recombinant BotA Protein From Folding Chaperones 

In this example of size exclusion chromatography was used to purify the recombinant 
5 BotA protein away from the folding chaperones and imidazole present in the IDA-purified 
material (Ex. 33). 

To enhance the solubility of the recombinant BotA protein during scale-up. the protein 
was co-expressed with folding chaperones (Ex. 33). As observed with the recombinant BotB 
protein (lixample 40 below), the folding chaperones co-eluted with the recombinant BotA 

0 protein during the Ni-IDA purification step. Because the recombinant BotA and BotB 
proteins have similar molecular weights (about 1/10 the size of the non-reduced folding 
chapcrone) and the imidazole step gradient strategy was unsuccessful in purifying BotB away 
from the folding chaperone (see lix. 40). size exclusion chromatography was examined for the 
ability to puriiy the recombinant BotA protein away from the folding chaperones. 

:> A column {2.5 \ 24 cm) containing Sephacryl S-100 HR (Pharmacia) was poured (bed 

volume 110 ml). Proteins having molecular weights greater than 100 K are expected to elutc 
in the \o'k\ volume under these conditions and smaller proteins should be retained by the 
beads and elute at different limes, depending on their molecular weights. l o maintain 
solubility of the purified BotA protein, the Sephacryl column was equilibrated in a buffer 

0 having the same salt concentration as the buffer used to elute the BotA protein from the IDA 
column (/ c. 50 mM sodium phosphate. 0.5 M NaCl. 10% glycerol: all reagents from 
Mallinkrodt, Cliesterfield, MO). 

Five milliliters of the IDA-purified recombinant BotA protein (Ex. 33) was filtered 
through a 0.45 /x syringe filter, applied to the column and the equilibration buffer was 

5 pumped through the column at a fiow rate of I ml/minute. Elutcd proteins were monitored 
by ab.sorbance at 280 nm and collected either manually or with a fraction collector (BioRad). 
Appropriate fractions were pooled, if necessary, and the protein was quantitatcd by absorbance 
at 2K0 nm and/or BCA protein assay (Pierce). The isolated peaks were then analyzed bv 
native and/or SDS-PAGE lo identify the proteins present and to evaluate purit\ . The folding 

0 chaperone eluied first, followed by the recombinant BotA protein and then the imidazole 
peak. 

SDS-PAGE analysis (12.5% polyacrylamidc. reduced samples) was used to evaluate 
the purity of the IDA-purified recombinant BotA protein before and after S-100 purification. 
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Figure 33 shows the difference in purity before and after the S-100 purification step. In 
Figure 33, lane 1 contains molecular weight markers (BioRad broad range). Lane 2 shows 
ihe IDA-purified recombinant BotA protein preparation, which is contaminated with 
significant amounts of the folding chaperone, l ollowing S-l()() purification, the amount oi' 
folding chaperone present in the BotA sample is reduced dramatically (lane 3). Lane 4 
contains no protein (/.c\. it is a blank lane): lanes 5-8 contain samples of IDA-purified 
recombinant BotB and BolE proteins and are discussed infra. 

Hndotoxin levels in the S-100 purified BotA preparation were determined using the 
LAL assay (Associates of Cape Cod) as describe in Hxample 24. I he purified BotA 
preparation was found to contain 22.7 to 45.5 ElJ/mg recombinant protein. 

These results demonstrate that size exclusion chromatography was successful in 
purilying the recombinant BotA protein from folding cbaperones and imidazole following an 
initial IDA purification step. Furthermore, these results demonstrate that the S-100 purified 
BotA protein was substantially free of endotoxin. 

EXAMPLE 35 

Cloning And Expression Of The C Fragment 
Of The C. hotulbmm Serotype B Toxin Ciene 

I he C . hoiulinum type B neurotoxin gene has been cloned and sequenced [Whelan vi 
al. (1992) Appl. luiviron. Microbiol. 58:2345 and Huison ei ul (1994) Curr. Microbiol. 
28:101]. I he nucleotide sequence of the toxin gene derived from the l-klund 17B strain 
(.A.rCC* 25765) is available from the EMBL/GenBank sequence data banks under the 
acces.sion number X7I343: the nucleotide sequence of the coding region is listed in SEQ 11^ 
NO:39. *f he amino acid sequence of the C. hoiulinum type B neurotoxin derived from the 
strain Eklund 17B is listed in SEQ ID NO:40. The nucleotide sequence of the C hotulimtm 
serotype B toxin gene derived from the Danish strain is listed in SEQ ID N():4I and the 
corresponding amino acid sequence is listed in SEQ ID NO:42. 

fhe DNA sequence encoding the native C hoiulintmi serotype B C iVagment gene 
derived from the Eklund I7B strain can be expressed using the pFl lIisb vector: the resulting 
coding region is listed in SFIQ NO:43 and the corresponding amino acid sequence is listed 
in SEQ ID NO:44. The DNA sequence encoding the native C. hofidimtm serotype B C 
fragment gene derived from the Danish strain can be expressed using the pETMisb vector; the 
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resulting coding region is listed in SEQ ID NO:45 and the corresponding amino acid sequence 
is listed in SEQ ID NO:46. The C frgament region from any strain of C hotulifmm serotype 
B can be amplified and expressed using the approach illustrated below using the C fragment 
derived from L\ hofulimim type B 2017 strain. 

The ( ' hoiitlinum type B neurotoxin gene is synthesized as a single polypeptide chain 
which is processed to form a dimer composed of a light and a heavy chain linked via 
disulfide bonds; the type B neurotoxin has been reported to exist as a mixture of predominatly 
single chain with some double chain (Whelan et aL, supra). The 50 kD carhoxy-tcrminal 
portion of the heavy chain is referred to as the C fragment or the domain. Expression of 
the C Jragmcnl of boiidmum type B toxin in heterologous hosts {e.^., E. coli) has not been 
previously reported. 

I he native C fragment of the ( ' boiulinum serotype B toxin gene was cloned and 
expression constructs were made to facilitate protein expression in E. toli. This example 
involved PCR amplification of the gene, cloning, and construction ol* expression vectors, 

I he C fragment of the C \ hotuiinum serotype B (BotB) toxin gene was cloned using 
the protocols and conditions described in Example 28 for the isolation of the native BotA 
gene. I he ( *. hotulinuw type B 2017 strain was obtained from the American T} pe Culture 
Collection (A fCC ^17843), The following primer pair was used to amplify the BotB gene: 
5*-C( iCXATGClC fG A fAC A ATACTA AT AGA A ATCi-3* [5' primer, engineered Nco\ site 
underlined (SEQ ID NO:47)l and 5'-C}C AAG CTT 7T/1TTCAG1 CCACCCTTCATC-3' [3' 
prmicr. engineered Hind\\\ site underlined, native gene termination codon italicized (SEQ ID 
N():4H)|. Alter cloning into the pCRscript vector, the A77L'l(niled)/W//7dlll fragment was 
cloned into pETUisb vector as described for BotA C fragment gene in Example 28. The 
resulting construct was termed pMisBotB. 

pHisBoiB expresses the BotB gene sequences under the transcriptional control of the 
r7 lac promoter and the resulting protein contains an N-ierminal lOXHis-tag affinity tag. The 
pHisBolB expression construct was transformed into BL2l(DE3) pLysS competent cells and 1 
liter cultures were grown, induced and his-tagged proteins were purified uiili/Jng a NiNTA 
resin (ekiied in low pli elution buffer) as described in E.xample 28. Total, soluble and 
punfioA proteins were resolved by SDS-PAGE and detected by Coomassie staining and 
Western blot hybridization utilizing a chicken anli-C hoiitlinum serotype B toxoid primary 
antibiKiy (generated by immuniziiiion of hens using C. hofulinnm serotype B toxoid as 
described in Example 3V Samples of BotA and BotE C fragment proteins were included on 
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the geis Ibr MW and immunogenicity comparisons. Strong immunoreactivity to only the 
BotB protein was delected with the anti-C. hoiulinum serotype B toxoid antibodies. The 
recombinant BotB protein was expressed at low levels (3 mg/liter) as a soluble protein. The 
purilled BotB protein migrated as a single band of the predicted MW (i.e., -50kD). 

These results demonstrate the cloning of the native C ' hoiulinum serotype B C 
fragment gene, the expression and purification of the recombinant BotB protein as a soluble 
his-lagged protein in E. cafi. 

EXAMPLE 36 

Generation Of Neutralizing Antibodies Using 1 he Recombinant pHisBotB Protein 

The ability of the purified pHisBol protein to generate neutralizing antibodies was 
examined. Nine BALBc mice were immunized with BotB protein (purified as described in 
lix. 35) using Cierbu CiMDP adjuvant (CC Biotech), fhe low pll elution was mixed with 
Cierbu adjuvant and used to immunize mice. Each mouse received a subcutaneous injection 
of 100 antigen/adjuvant mix (15 pg antigen -i- I (.ig adjuvant) on day 0. Mice were 
subcutaneousK boosted as above on day 14 and bled on day 28, Mice were subsequently 
boosted 1-2 weeks after bleeding and were then bled on day 70. 

Anii-r. hoiulinum serotype B toxoid tilers were determined in day 28 serum from 
individual mice from each group using the FLISA protocol outlined in lixample 29 with the 
exception that the plates were coated with C\ hoiulinum serotype B toxoid, and the primary 
antibod\ was a chicken anii-(" hoiulinum serotype B toxoid. Seroconversion (relative to 
control mice immunized with pHisBotH antigen i described below)! was observed with all 9 
mice immunized with the purified pHisBotB protein. 

fhe ability of the anti-BotB antibodies to neutralize native C hoiulinum type B toxin 
was tested in a mt)use-C' hoiulinum neutralization model using pooled mouse serum (see l£x. 
23h). I he LD^,t of purified hoiulinum type B toxin complex (Or. Lric Johnson. University 
of Wisconsin. Madison) was determined by a intraperitoneal (IP) method [Schantz and Kautler 
(1978). .\upra\ using 18-22 g female ICR mice. The amount of neutralizing antibodies present 
in the serum of the immunized mice was determined using serum antibody titrations. I he 
various serum dilutions (0.0 1 ml) were mixed with 5 LD^„ units i>f T. hoiulmum type B toxin 
and the mixtures were injected IP into mice. I he neutralizations were performed in duplicate, 
fhe mice were then observed for signs of botulism for 4 days. Undiluted serum (day 28 or 
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day 70) was found to protect J 00% of the injected mice while the 1:10 diluted serum did not. 
This corresponds to a neutralization titer of 0.05-0.5 lU/ml. 

These results demonstrate that seroconversion occurred and neutralizing antibodies 
were induced when the plIisBotB protein was utilized as the immunogen. 

EXAMPLE 37 

Construction Of Vectors To Facilitate Expression 
Of His-Taggcd BolB Protein in Fermentation Cultures 

A number of expression vectors were constructed to facilitate the expression of 
recombinant BotB protein in large scale fermentation cuhure. 1 hese constructs varied as to 
the strength of the promoter utilized (T7 or T71ac) and the presence of repressor elements 
(lack]) on the plasmid. The rcsuhing constructs varied in the level of expression achieved and 
in plasmid stabihty which facilitated the selection of a optimal expression svsteni for 
fermentation seal cup. 

I he BotB expression vectors created for fermentation culture were engineered to 
utilize the kanamycin rather than the ampicillin resistance gene, and contained cither the T7 or 
TTIac promoter, with or without the laclq gene for the reasons outlined in Example 30. 

In all cases, the protein expressed by the various expression vectors is the pHisBol B 
protein described in Example 35. with the only differences between clones being the alteration 
t)t various regulatory elements. Using the designations outlined below, the pIIisBotB clone 
(1:\. 35) is equivalent to plIisBotB amp T7lac. 

a) Construction Of pliisBotB kan T71ac 

pI IisBotB kan T71ac was constructed by insertion of the Bf^lU/HindlU fragment of 
pilisBolB which contains the BotB gene sequences into the pPAI870-2680 kan T71ac vector 
which had been digested with B^IU and HindUl (the pi^Al 870-2680 kan r7lac vector contains 
the pi:T24 kan gene in the pET23 vector, such that no laclq gene is present). Proper 
construction of pHisBotB kan T7lac was conllrmed by restriction digestion. 
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b) Construction Of pHisBotB kan laclq T7lac 

pHisBolB kan laclq T71ac was constructed by insertion of the B^IU/HindUl fragment 
of pHisBolB which contains the BolB gene sequences into similarly cut pFT24a vector. 
Proper construction of pHisBotB kan laclq ITIac was confirmed by restriction digestion. 

c) Construction Of pHisBotB kan laclq T7 

pHisBotB kan laclq 17 was constructed by inserting the Ndel/Xhol rrugmenl from 
pHisBotn kan laclq T7luc which contains the BotB gene sequences into similarly cleaved 
pPA 1870-2680 kan laclq T7 vector (this vector contains the T7 promoter, the same N- 
termmal his-iag as the Bot constructs, the difficile toxin A insert, and the kan laclq genes: 
this cloning replaces the C\ difficile toxin A insert with the BotB insert). Proper construction 
was confirmed by restriction digestion. 

lixpression of recombinant BotB protein from these expression vectors and purification 
of the BotB proiein is described in Example 38 below. 

EXAMPLE 38 

f ermentation And Purification Of Recombinant BotB Protein Utilizing I he 
pl lisBotB kan laclq T71ac. pHisBotB kan T7lac And pHisBolB kan laclq 17 Vectors 

The pHisBotB kan laclq T71ac. plIisBotB kan T7lac and BotB kan laclq 17 constructs 
iall transformed imo the B121(Db:3) strainj were grown in fermentation cultures to determine 
the utility of the various constructs for large scale expression and purification of soluble BotB 
protein. All fermentations were performed as described in Rxample 31. 

a) Fermentation Of pHisBotB kan laclq T7lac/BI21(DE3) Cells 
The fermentation culture was induced 45 min post start of glucose feed with I gm 
IPTCi (fmal concentration = 0.4 mM). pM was maintained at 6,5 rather than 7.0. The OD^,„, 
was 27 at lime of induction, then 35, 38. and 40 at 1-3 hrs post induction. Duplicate platings 
of diluted 1 hr induction samples (dilutions were prepared as described l^x. 31. dilution 3 
utilized 3 \.\\ of dilution 2 cells) on fSA and LB+^kan plaies yielded 8*) TSA colonics and 81 
kan colonies (90% kan resistant). 

l olal and soluble protein extracts were resolved on a 12,5% SOS-PAGE gel and total 
protein was detected by staining with Coomassie blue. Low level induction of insoluble Bot 
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B protein was observed, increasing from I to 3 lirs post induction (no expression was delected 
in uninduced cells). 

b) Fermentation Of pHisBotB kan T7Iac/B121(DE3) Cells 

5 The fermentation culture was induced I hr post start of glucose feed with 2 gm IPTG 

(fmai concentration ^ 0.8 mM). pH was maintained al 6.5 rather than 7.0. fhc OD^^tj^, was 
24.5 al time of induction, then 31.5. 32. and 33 at 1-3 hrs post induction, respectively. 
Duplicate platings of diluted 0 hr and 2 hr induction samples (dilutions were prepared as 
described 31: dilution 3 utilized 3 of dilution 2 cells) on TSA and LB+kan plates 

10 yielded 32 TSA colonies and 54 kan colonies (all kan resistant) for uninduced cells, and 1 
I SA colony and 0 kan colonies 2 hr post induction. These results were indicative of strong 
induction, since viable counts decreased dramatically 2 hrs post induction. 

Tola! and soluble extracts were resolved on a iO% SI^S-PAGE gel and total protein 
wns delected by staining with Coomassie blue. Moderate induction ol insoluble BolB protein 

15 was oh.served. increasing from I to 3 hrs post induction (no expression was delected in 
uninduced cells). 

c) Fermentation Of pHtsBotB kan laclq T7/BI21(DE3) Cells 

The tcrmentation was induced 2 hr post start of glucose Iced with 4 gm IPI Ci (final 
20 concentration = 1.6 mM). pH was maintained at 6.5 rather than 7.0. I he ()D„„, was 45 at 
lime of induction, then 47. 50. and 50 and 55 at 1-4 hrs post induction, respectively. Viable 
colony counts decreased aficr induction (%, 1. 1. 2, 3; dilution 3 utilized 3 pi of dilution 2 
ceMsl. Of 63 colonies scored at the lime of induction, all 63 retaining ihc BoiB plasmid (kan 
resisianU and no colonies at induction grew on IPTG -r Kan plates (no mutations detected). 

Total and soluble extracts were resolved on a 12.5% SDS-PAGl£ gel and total protein 
was delected by staining with Coomassie blue. Moderate level induction of insoluble BotB 
protein was observed, increasing from 1 to 4 hrs post induction (lower level expression was 
detected in uninduced cells, since the T7 rather than T7Iac promoter was utilized). 

30 d) Purification Of pHisBotB Protein From pHisBotB ikmp 

T7lac/BI21(DE3) Cells 

Soluble recombinant BotB protein was purified utilizing NiNTA resin from HO ml of 
cell lysate generated from cells harvested from a pHisBotB fermentation [using the pHisBotB 
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amp T7lac/B121(DE3) strain]. As predicted from the small scale results above, the majority 
of the induced protein was insoluble. As well, the cluled material was contaminated with 
multiple £ coli contaminant proteins. A Coomassie blue-stained SDS-PAGF gel containing 
extracts derived from pHisBotB amp T71ac/B121(DF;!3) cells before and during purification is 
shown in Figure 34/ In I-igure 34. lane 1 contains broad range protein MW markers 
(BioRad). Lanes 2-5 contain extracts prepared from pHisBotB amp r71ac/B121(DH3) cells 
grown in lermentaiion culture: lane 2 contains total protein: lane 3 contains soluble proiein: 
lane 4 contains protein which did not bind lo the NiNTA column the How-through) and 
lane 5 contains protein cluted from the NiNl A column. 

Similar results were obtained using a small scale IDA column utilizing a ceil lysaic 
from the pHisBotB kan laclq T7 fermentation described abme. 250 mis of a 20% w/v PFI 
claritled lysaie (50 gms cell pellet) of botB kan laclq T7/B12UDl:3) cells were purified on a 
small scale IDA column. I he total yield of elutcd proiein was 21 mg protein (assuming I 
mg/ml solution 2 OD.j„/ml). When analyzed by SDS-PACiF and Coomassie staining, the 
BolB proiein was found to comprise approximately 50% of the elutcd protein with the 
remainder being a ladder of £. coli proteins similar to that observed with the NiN i .A 
purification. 

I he NiNTA alkaline phosphatase conjugate was utilized to detect his-iagged proteins 
on a Western blol containing total, soluble, soluble (PFI clarified), soluble (after IDA column) 
and elution samples from the IDA column purification. The results demonstrated that a small 
percentage of BolB protein was soluble, that the soluble pnnein was not precipitated by PFJ 
treatment and was quantitatively bound by the IDA column. Since a 1 liter fermentation 
harvest yielded a 67.5 gm cell pellet, this indicated that the yield of soluble affinity purified 
BotB proiein from the IDA column was 14 mg/litcr. 

EXAMPLE 39 

Co-Expression Of Recombinant BolB Proteins 
And Folding Chaperones In Fermentation C\iltures 

f ermentations were performed to determine if the simultaneous overexpression of 
folding chaperones (i.e.. the Clro operon) and the BolB protein resulted in enhanced solubiHty 
of the Bot B protein. This example involved fermentation of the pIIisBotBkan laclq 
T71ac/pACYCGro BL21(DE3). pHisBotB kan T71ac/pACYCGro B121(DE3) and pHisBotBkan 
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laclq T7/ pACYCGro BL21(DE3) cell lines. Fermentation was carried out as described in 
Example 31; 34 ^ig/ml chloramphenicol was included in the feeder and fermentation cultures. 

a) Fermentation Of pHisBotBkan laclq T7lac/pACYCGro 
BL21(DE3> Cells 

Induction was with 4 gms IPTG at 1 hr 15 min post initiation of the glucose feed. 
The Ol\i^, was 38 at lime of induction, then 50. 58.5. 62 and 68 at 1-4 hrs post induction. 
Viable colony counts decreased during induction (24. 0, 0, 2, 0 at 0-4 hr induction: dilution 3 
utilized 3 yi\ of dilution 2 cells). Of 24 colonies scored at the time of induction. 24 retained 
the BolB plasmid (kan resistant), 24 contained the chaperone plasmid (chloramphenicol 
resistant) and no colonies at induction grew on IPTCh kan plates (no mutations detected). 

I olal and soluble extracts were resolved on 12.5% SDS-PAGE gels and were either 
stained with Coomassie blue or subjected to Western blotting (his-iagged proteins were 
detected utilizing the NiNTA-alkaiine phosphatase conjugate). This analysis revealed that the 
Ciro chaperones were induced to high levels, but ver\' low level expression of soluble BotB 
proiem was observed, increasing from 1 to 4.0 hrs pos\ induction (no expression detected in 
uninduced cells, induced protein delected only on Western blot). I he dramatically lower 
expression ol BotB protein in the presence of chaperone may be due to promoter occlusion 
{i,u,. the stronger T7 promoter on the chaperone plasmid was preferentially utilized). 

b) Fermentation Of pHisBotB kan T7lac/pACYCCro/BI21(DE3) 
Cells 

Induction was with 4 gms I PTC} at 1 hr post initiation of the glucose teed. The OD^^, 
was 33.5 at lime of induction, ihen 44. 51. 58.5 and 69 at 1-4 hrs post induction. Viable 
colon\ counts decreased after 2 hrs induction (43. 65, 74. 0 (70). 0 (70) at 0-4 hr induction; 
brackeled numbers represent microcolonies: dilution 3 utilized 3 ^il of dilution 2 cells). Most 
colonies at induction retained the BotB plasmid (kan resistant )and the chaperone plasmid 
(chloramphenicol resistant) and no colonies at induction grew on IPTG+Kan plates (no 
mutations detected). 

l otal and soluble extracts were resolved on a 12.5% SDS-PACih gel and subjected to 
Western blotting: his-tagged proteins were delected utilizing the NiNTA-alkaline phosphatase 
conjugate. This analysis revealed that the Gro chaperones were induced to high levels and 
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low level expression of soluble Bot B protein was observed, increasing from 1 to 4.0 hrs post 
induction (no expression detected in uninduced cells). 

A small scale IDA purification ofBolB protein from a 250 ml PHI clarified 15% w/v 
extract (37.5 gm cell pellet) yielded approximately 12.5 mg protein, of which approximately 
50% was BotB protein and 50% was GroEL chaperone (assessed by Coomassie staining of a 
10% SDS-PAGH gel). The NiNTA alkaline phosphatase conjugate was utilized to detect his- 
laggcd proteins on a Western blot containing lolaK soluble, soluble (PEl clarified), soluble 
(after IDA column) and elution samples from the IDA column purification, fhe results 
demonstrated that all of the BotB protein produced by the pl lisBoiB kan 
T71ac/pACYC(iro/BI21(DE3) cells was soluble: the BolB protein was not precipitated by PEI 
trcalmeni and was quantitativeh bound by the IDA column. Since a 1 liter fermentation 
harvest yielded a 75 gm cell pellet, this indicated that the yield of soluble affinity purified bot 
13 protein from this fermentation was 12.5 mg/liier. I hese results also demonstrated that 
additional purification steps arc necessary to separate the chaperone proteins from the BolB 
protein. 

c) Fermentation Of pHisBotBkan Inclq 
T7/pAC YCC;ro/BL2l(DE3) Cells 

Induction was with 4 gms IPTG at 2 hr post initiation of the glucose feed, I he 00^,,^) 
was 46 at time of induction, then 56. 63. 69 and 71.5 at 1-4 hrs post induction. Viable colony 
counts decreased after induction (58. 3(5), 3. 0. 0 at 0-4 hr induction: bracketed numbers 
represent microcolonies: dilution 3 utilized 3 of dilution 2 cells). All (53/53) colonies 
scored at the lime of induction retained the BotB plasmid (kan resistant) and the chaperone 
plasmid {chloramphenicol resLsianl) and no colonies at induction grew on IP TG+Kan plates 
(no mutations detected). 

Total and soluble extracts were resolved on a 10% SDS-PAGE gels and Western 
blotted and his-iaggcd proteins were delected utilizing the NiN'l A-alkaline phosphatase 
conjugate. This analysis revealed that ihe Gro chaperones were induced to high levels 
(observed by ponceau S staining), and a much higher expression of soluble Bot B protein 
(compared to expression in ihe pHisBotB kan T7lac/pACYCGro fermentation) was observed 
at all timepoints. including uninduced cells (some increase in BotB protein levels were 
observed after induction). 
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A small scale IDA purification of BotB protein from a 100 ml PFJ clarified 15% w/v 
extract (15 gm cell pellet) yielded approximately 40 mg protein, of which approximately 50% 
was BotB protein and 50% was GroEL chaperone. as assessed by Coomassie staining of a 
10% SDS-F^AGE gel. The NiNTA alkaline phosphatase conjugate was utilized to delect his- 
5 tagged proteins on a Western blot containing total, soluble, soluble (PEI clarified), soluble 
(after IDA column) and elution samples from the IDA column purification. The results 
demonsiraicd that a significant percentage {/.e.. -10-20 %) of BotB protein was soluble, that 
the solubiiized protein was not precipitated by PEI treatment and was quantitatively bound by 
the IDA column. Since a 10 liter fermentation yielded a 108 gm cell pellet, this indicated that 

10 the yield oi" soluble affinity purified BotB protein from this fermentation was 144 mg/liter. 

In a scale up experiment, 2 liters of a 20% w/v PEI clarified lysale of pHisBotB kan 
lack) I 7/pACYCGro/BL21(DE3) cells were purified on a large scale IDA column. The 
puriilcaiion was performed in duplicate. The total yield of BotB protein was 220 and 325 
mgs protein in the two experiments (assuming I mg/ml solution - 2.0 OD.j„/ml). This 

15 represents 0.7% or 1.0%. respectively, of the total soluble cellular protein (assuming a PEI 

lysiale having a concentration of 8 mg protein/ml and that the eluied material comprises a 1:1 
mixture of BotB and folding chaperone). fhe NiNTA alkaline phosphaia.se conjugate was 
utilized to detect his-tagged proteins on a Western blot containing total, soluble, soluble (PEI 
clarified), soluble (after IDA column) and elution samples from the IDA column purification. 

20 These results demonstrated that a significant percentage (/.c. -10-20 %) of the BotB protein 
was soluble, that the solubiiized protein was not precipitated b\ PEI treatment and was 
quantitatively bound by the IDA column. Since a I liter fermentation harvest yielded a 108 
gm cell pellet, this indicated that the yield of soluble affinity purified BotB protein from the 
large scale purification was 60 mg or 89 mg/litcr. These results also demonstrated thai 

25 further purification would be necessary to remove the contaminating chaperone protein. 

The above results provide methodologies for the purification ol scMuble BotB protein 
from fermentation cultures, in a form contaminated predominantly with a single E. coli 
protein (the folding chaperone utilized to enhance solubility). In the next example, methods 
are pnnided for the removal of the contaminating chaperone protein. 

30 
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EXAMPLE 40 

Removal Of Contaminating Folding Chaperone Protein 
From Purified Recombinant C hohdinum l ypc B Protein 

5 In this example size exclusion chromatography and ultrafiltration was used to purify 

recombinant BotB protein from the folding chapcroncs and imidazole in IDA-purified 
material. 

l o enhance (he solubility of the recombinant BolB protein during scale-up. the protein 
was co-cxpresscd with folding chaperones (see l£x. 39). During the Ni-IDA purification step. 

10 the folding chaperones co-eluted with the BolB protein in 800 mM imidazole: therefore, a 

seccmd purification step was required lo isolate the BotB free of folding chaperones. I.ane 3 
of Figure 35 contains proteins eluted from an IDA column to which a lysatc of pHisBoiB kan 
laclq T7/pACYCGro/BL21(Dn3) cells had been applied: the proteins were resolved on a 
4-15% polyacrylamide pre -cast gradient gel (Bio-Rad. Hercules, CA) run under native 

15 conditions and then stained with Cooniassic blue. In l igure 35. lanes I and 4 contain proteins 
present in peak I and peak 2 from a Sephacryl S-10() column run as described below: lane 2 
is blank. 

As seen in lane 3 of Figure 35. the IDA-purified sample consists primarily of the 
folding chaperones and the BotB protein. I he fact that the chaperones and the l^ot B antigen 

20 appear as two distinct bands under native conditions suggested they were not complexed 
together and therefore, il should be possible to separate ihem. using either a gradient of 
imidazole concentrations or size exclusion methods. 

In order to determine whether a gradient of imidazole concentrations could be used to 
separate the chaperone from the BolB protein, a step gradient using imidazole at 200. 400. 

25 600. and 800 mM in 50 mM sodium phosphate. 0.5 M NaCI and 10 % glyccrt)!. pH 6.8 was 
applied to an IDA column (containing proteins bound from a lysaie of pIIisBolB kan laciq 
T7/pAC'YCGro/BL21(Dli3) cells). By narrowing the range of imidazole concentrations, it 
was hoped that the BotB and chaperone proteins would differentially elule at different 
concentrations of imidazole. Eluted proteins were monitored by absorbance at 280 nm and 

30 collected either manually or with a fraction collector (BioRad). Protein was found to elute at 
200 and 400 mM imidazole only. 

Figure 36 shows a Coomassie stained SDS-PAGF gel containing protein eluted during 
the imidazole step gradient. Lane I contains broad range MW markers (BioRad). Lane 2 
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contains BolB protein purified by IDA chromatography of an extract of pHisBotB/BL21(DE3) 
pLysS cells grown in shaker flask culture (Ac. no co-expression of chaperones; Bx. 35). 
Lane 3 contains a 20% w/v PEI clarified lysate of pHisBotB kan laclq 
r7/pACYCGro/BL2 1(1)1:3) cells (/.c, the lysate prior lo purification by IDA 
5 chromatograpiiy). Lanes 4 and 5 contain protein which eluted at 200 or 400 niM imidazole, 
respectively. Lane 6 is blank. Lanes 7 and 8 contain 1/5 the load present in lanes 4 and 5. 

As shown in Figure 36. both the chaperone and the BotB protein eluted in 200 mM 
imidazoic- and more chaperone eluies in 400 niM imidazole, however no concentration of 
imidazole tested permitted the eiulion of EiotB protein alone. Consequently, no significant 

10 purification was achieved using imidazole at these concentrations. 

Becau.se of the considerable difference in molecular weights between the folding 
chaperone. which is a multimcr with a total molecular weight around 400 kD (as determined 
on a Shodex KB 804 sizing column by HPLC). and the recombinant BotB protein (molecular 
weight around 50 kD). size exclusion chromatography was next examined for the ability u> 

15 .separate these proteins. 

a) Size Exclusion Chromatography 

A column containing Sephacryl S-100 UR (S-100) (Pharmacia) was poured (2.5 cm x 
24 cm : 110 ml bed volume). The column was equilibrated in a buffer consi.siing of 

20 phosphate buffered saline ( lOmM potassium phosphate. 150 mM NaCl. pM 7.2) and 10 % 

glycerol (Mallinkrodt). Typically. 5 ml of the IDA-purified BotB protein was filtered through 
a 0.45 f.1 syringe filler and applied to the column, and the equilibration bufifcr was pumped 
through the column at a How rale of 1 ml/minute. Eluted proteins were monitored by 
absorbancc al 280 nm and collected either manually or wilh a fraction collector. Appropriate 

25 lubes were pooled, if necessary, and the protein was quantiiated by absorbance at 280 nm 
and/or by BCA protein assay. The isolated peaks were then analyzed by native and/or 
SDS-PAGE to identify the protein and evaluate the purity. 

Because of its larger size, the folding chaperone eluted first, followed by the 
recombinant BotB protein. A smaller third peak was observed which failed to slain when 

30 analyzed by SDS-PAGE and therefore was presumed to be imidazole. 

SDS-PAGE analysis (12.5% polyacrylamide, reduced samples) was used lo evaluate 
the purity of the IDA-purified recombinant BotB protein before and after S-lOO purification. 
The results are shown in Figure 33. 
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In Figure 33. lane i contains broad range MW markers (BioRad). Lane 5 contains 
IDA-puritled BotB protein. Lane 6 contains IDA-puritled BotB protein following S-100 
puriHcation. Lane 7 is blank (lanes 2-4 were discussed in Fx. 34 above). 

The results shown in Figure 33 show that the IDA-purified BotB is significantly 
contaminated with the folding chaperone (molecular weight about 60 ki) under reducing 
conditions: lane 6). Following S-100 purification, the amount of folding chaperone present in 
the BotB sample was reduced dramatically (lane 7). Visual inspection of the Coomassie 
stained SDS-PAGH gel revealed that after S-100 purification. > W% of the total protein 
present was BotB. 

The IDA-purificd BotB and the S-lOO-purified BotB samples were analyzed by HPLC 
on a size exclusion column (Shodex KB 804); this analysis revealed that the BotB protein 
represented 64% of the total protein in the IDA-purified sample and that following S-lOO 
purification, the BotB protein represented >95% of the total protein in the sample. 

The IDA-purified BotB material was also applied to a ACA 44 (Speciral\^r. Houston, 
rX) c(>lunin. The ACA 44 resin is equivalent to the S-U)0 resin and chromatography using 
the ACA 44 resin was carried out exactly as described above for the S-lOO resin. The ACA 
44 resin was found to separate the recombinant BotB protein from the folding chaperone. 
The ACA 44-purified BotB sample was analyzed for endotoxin using the LAL assay 
(Associates of Cape Cod) as describe in Example 24. I wo aliqouls of the ACA 44-punficd 
lioiB preparation were analyzed and were found to contain either .S8 to 1 16 I U/mg 
recombinant protein or 94 to 1 89 EU/mg recombinant protein. 

1 hese results demonstrate that size exclusion chromatography can be used to purify the 
recombinant liotB protein iVom the folding chaperone and imidazole in IDA-purified material. 

b) IJIfranitration For The Separation Of Recombinant BotB 
Protein And Chapcrones 

Ultrafiltration was examined as an alternative method for the separation recombinant 
BotB protein and folding chaperones in IDA-purificd material. While in this example only 
mixtures of BotB and chaperones were separated by ultrafiltration, this technique is suitable 
for use with recombinant BotA and BoiH proteins as well provided that the wash buffers used 
are altered as necessary to take into account different requirements for solubility. 

fhe recombinant BotB protein and folding chapcrones were separated using a two-step 
sequential ultrafiltration method. The first membrane used had a nt)minal molecular weight 
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culotf (MWCO) of approximately 100 kD: this membrane retains the larger folding chaperonc 
while allowing the smaller recombinant protein to pass through. The addition of several 
volumes of wash buffer may be required to cfilciently wash the recombinant protein through 
the membrane, fhe second step utilized a membrane with a nominal MWCO of 
approximately 10 kD. During this step, the recombinant antigen was retained by the 
membrane and could be concentrated to the degree desired and the imidazole and excess wash 
buffer passed through the membrane. 

I wenty-seven milliliters of an IDA-puritled BoiB preparation was ultratlltercd through 
a 47 mm YM 100 (100 kD MWCO) membrane (Amicon) in a 50 mi stirred cell (Amicon). 
The membrane was washed in dd ILO prior to use as recommended by the manufacturer. Six 
volumes of 10% glycerol in PBS were washed through to remove most of the recombinant 
BotB protein and this wash was collected in a separate vessel. The resulting BotB 
protein-rich tlltraic was then concentrated i 2-fold using a YM 10 (10 kD MWCO) membrane 
{.Am icon), lo a linal volume of 14 mi. The YM 100 and YM 10 concentrates were analyzed 
along with the lysate starting material by native PAGE using a 4 -15% pre-casi gradient gel 
(BioRadK I hc results are shown in Figure 37. 

In Figure .^7. lane I contains IDA-purificd BotB derived iVom a shaker llask culture 
{i.e.. ni> co-expression oi' chaperones: Lx. 35): lane 2 contains a 20% w/v PF[ clarified lysaie 
ol pHisBotB kan laclq T7/pACYCGro/BL21(DE3) cells; lane 3 shows the lysate of lane 3 
after IDA puriikation: lane 4 contains the YM 10 concentrate and lane 5 contains the YM 
100 concentrate. 

i he results shown in Figure 37 demonstrate that the recombinant BotB protein can be 
purified away from the folding chapcrone by ultrafiltration through a 100 kD MWCO 
membrane (lane 4). leaving the chapcrone protein in the 100 kD concentrate (lane 5). 
Analysis of the sample in lane 5 also showed that very little of the BotB protein was retained 
by the 100 kD MWCO membrane after 6 volumes of wash buffer had been applied. 

Ihe BotB samples following IDA chromatography and following ultrafiltration through 
the YM 100 membrane were aniyzcd by UPLC on a size exclusion column (Shodex KB 804): 
this ainilysis revealed that the BotB protein represented 64% of the total protein in the IDA- 
purificd sample and that following ultrafiltration through the YM 100 membrane, the BotB 
protein represented >96% of the total protein in the sample. 

The BotB protein purified by ultrafiltration through the YM 100 membrane was 
examined for endotoxin using the LAL assay (Associates of Cape Cod) as describe in 
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Example 24. Two aliqouts of the YM 100-puriticd BolB preparaiion were analyzed and were 
found to contain either 18 to 36 EU/mg recombinant protein or 125 to 250 EU/mg 
recombinant protein. 

The above results demonstrate that size exclusion chromatography and ultrafiltration 
5 can be used to purify recombinant botulinal toxin proteins away from folding chapcrones. 

EXAMPLE 41 

Cloning And Expression Of The C Fragment 
Of The i\ botidmum Serotype E l oxin Gene 

10 

The C hoiulimim type E neurotoxin gene has been cloned and sequenced from several 
different strains [Poulet et uL (1992) Biochem. Biophys. Res. Commun. 183:107 (strain 
Ikluga): Whelan ct ai. (1992) Eur. J. Biochem. 204:657 (strain NCTC 11219): fujii ei uL 
(IWO) Microbiol. Immunol. 34:1041 (partial sequence of strains Mashike. Uvani and Otaru) 

15 and Fujii et at. (1993) J. Gen. Microbiol. 139:79 (strain Mashike)). The nucleotide sequence 
of the type E toxin gene is available from the EMBL sequence data bank under acccssit>n 
numbers X62089 (strain Beluga) and X62683 (strain NCTC 1 1219). I'he nucleotide sequence 
of the coding region (strain Beluga) is listed in SEQ ID N():49. fhc amino acid sequence of 
the (* hondinum type I- neurotoxin derived from strain Belgua is listed in SEQ ID NO:50, 

20 The nucleotide sequence of the coding region (strain NCTC 11219) is listed in SEQ ID 

NO:51. I he amino acid sequence of the C hoiulimtm type E neurotoxin derived from strain 
NCTC 1 1219 is listed in SEQ ID NO:52. 

fhe DNA sequence encoding the native ( holulinum serotype E C fragment gene 
derived from the Beluga strain can be expressed as a histidinc-taggcd protein using the 

25 pETllisb vector; the resulting coding region is listed in SEQ ID NO:53 and the corresponding 
amino acid sequence is listed in SEQ ID NO;54. I he DNA sequence encoding the C 
fragment of the native ( ' hotulimtm serotype E gene derived from the NCTC 1 1219 strain 
can be expressed as a histidine-tagged fusion protein using the pE l l lisb vector: the resulting 
coding region is listed in SEQ ID NO:55 and the corresponding amino acid sequence is listed 

30 in SEQ ID NO:56. Fhe C fragment region from any strain of C. hmulimim serotype E can be 
amplified and expressed using the approach illustrated below using the C fragment derived 
from hotulimtm type E 223 1 strain (A ICC 17786). 
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The type E neurotoxin gene is synthesized as a single polypeptide chain which may be 
converted to a double-chain form {i\c., a heavy chain and a light chain) by cleavage with 
trypsin: unlike the type A neurotoxin, the type E neurotoxin exists essentially only in the 
single-chain form. 1 he 50 kD carboxy-terminal portion of the heavy chain is referred to as 

' the C fragment or the H(- domain. Expression of the C fragment of C hoiulinum type E toxin 

in heterologous hosts (c.i^.. co/i) has not been previously reported. 

The native C fragment of the hondinum serotype E toxin (BotE) gene was cloned 
and inserted into expression vectors to facilitate expression of the recombinant BotE protein in 
E. coli. Phis example involved PCR amplification of the gene, cloning, and construction of 

10 expression vectors. 

The BotE serotype gene was isolated using PCR as described for the BotA serotype 
uenc in Example 28, The C. hoiulinum type E strain was obtained from the American Type 
Culture Collection (ATCC #17786; strain 2231). The following primer pair was used in the 
PCR amplification: 5 ' -CG CC ATGG CTCTTTCTTCTTAT ACACiATGA 1-3" (5" primer, 

15 engineered Sca\ site underlined) (SEQ ID NO:57) and 

5*-(iC AMiQT77:-ITTTTTCTTGCCA TCCATG-^ (3' primer, engineered ////Tdlll site 
underlined, native gene termination codon italicized) (SEQ N():58)- The PCR product was 
inserted into pCRscripl as described in Example 28. The resulting pCRscript BotE clone was 
confirnicd by restriction digestion, as well as. by obtaining the sequence of approximately 300 

20 bases located at the 5' end of the C fragment coding region using standard DNA sequencing 
methods. I he resulting BotE sequence was identical to that of the published C hotulimmi 
type E toxin sequence |Wheian ct al (1992), supra]. 

The A/7t'Ufillcd)////ndIII fragment from a pCRscript BotE recombinant was cloned into 
pETHisb vector as described for BotA C fragment in Example 28. The resulting construct 

2^ was termed pHisBotE. pHisBotE expresses the BotE gene under the control of the T7 lac 
promoter and the resulting protein contains an N-terminal lOXFIis-tag affinity lag. 

The pHisBotE expression construct was transformed into BL21(DE3) pLysS competent 
cells and I liter cultures were grown, induced and his-tagged proteins were purified utilizing a 
NiNI A resin (eluted in low pH elution buffer) as described in Example 28. Total, soluble 

30 and purified proteins were resolved by SDS-PAGE and detected by Coomassic staining. The 
results are shown in Figure 38. 

In Figure 38, lane 1 contains broad range MW markers (BioRad): lane 2 contains a 
total protein extract: lane 3 contains a soluble protein extract: lane 4 contains proteins present 
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in the llow through from the NiNTA column (this sample was not diluted prior to loading and 
therefore represents a load 5X that of the load applied for the total and soluble extracts in 
lanes 2 and 3); lane 5 contains proteins eluled from the NiN I'A column: lane 6 contains 
protein e luted I'rom a NiNTA column which had been stored at -20^*0 for 1 year. 

The pIlisBolR protein was expressed ai niixleratc levels (7 mg/liler) as a totally soluble 
protein. The purified protein migrated as a single band of the predicted MW. 

Western blot hybridization utilizing a chicken anii-C hoiidinum serotype \\ toxoid 
primary antibody (generated by immunization of hens as described in Example 3 using L\ 
botulimtm serotype E toxoid ) was also performed on the total, soluble and purified BoiE 
proteins. Samples of BolA and BotB C fragments were also included on the gels to facilitate 
MW and immunogenicily comparisons. Strong immunoreactivity was delected using the 
anti-( . hoiuUmim type H toxoid antibody only with the UotE protein. 

rhese results demonstrate that the native BotE gene sequences can be expressed as a 
soluble his-taggcd protein in E, coli and purified by meial-chelalion affinity chromatography, 

EXAMPLE 42 

Cieneralion Of Neutralizing Antibodies Using The Recombinant pHisBotE Protein 

The ability of the purified pHisBotE protein lo generate neutralizing antibodies was 
examined. Nine BAEBc mice were immunized with BotE protein (purified as described in 
Ex. 41 ) using (ierbu GMDP adjuvant (CC Biotech). I he low pH elution was mixed with 
(ierbu adjuvant and used lo immunize mice. Each mouse received a subcutaneous injection 
of 100 Lil antigen/adjuvant mix (35 pg antigen + 1 f.ig adjuvant) on day 0. Mice were 
subculaneously boosted as above on day 14 and bled on day 28. Mice were subsequently 
boosted and bled on day 70, 

Anti-C. hotulimtm serotype E toxoid liters were determined in day 28 serum from 
individual mice from each group using the ELISA protocol outlined in Example 29 with the 
exception thai the plates were coated with C hotuUnum serotype E toxoid, and ihe primary 
antibody vtas a chicken anli-(* hottdmum serotype E toxoid. Seroconversion [relative to 
control mice immunized with the p6xlIisBotA antigen (i-x. 29)1 was observed with all 9 mice 
immunized with the purified pHisBotl" protein. 

The ability of the anli-BoiE antibodies to neutralize native C hoiulmum type E toxin 
was tested in a mouse-C. botulimmi neutralization model using pooled mouse serum (see Ex. 
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23b). The LD^o of purified C hotulinum type E toxin complex (Dr. Eric Johnson. University 
of Wisconsin. Madison) was determined by a intraperitoneal (IP) method [Schantz and Kautler 
(1978). supra j using 18-22 g female ICR mice. The amount of neutralizing antibodies present 
in the serum of the immunized mice was determined using serum antibody titrations. The 
various serum dilutions (0.01 ml) were mixed with 5 LD.t^ units of C. hondinum type E toxin 
and the mixtures were injected IP into mice. The neutralizations were performed in duplicate. 
I he mice were then observed for signs of botulism for 4 days. Undiluted serum from day 28 
did not protect, while undiluted, 1/10 diluted and 1/100 diluted day 70 serum protected (1005 
of animals) while 1/1000 diluted day 70 serum did not. This corresponds to a neutralization 
titer of 50-500 lU/ml. 

I hcse results demonstrate that seroconversion occurred and neutralizing antibodies 
were induced when the recombinant Both protein was utilized as the immunogen. 

EXAMPLE 43 

Construction Of Vectors To Facilitate Expression 
Of His-'l agged BotE Protein In Fermentation Cultures 

A number of expression vectors were constructed to facilitate the expression of 
recombinant BotE protein in large scale fermentation culture. These constructs varied as to 
the strength ot the promoter utilized (T7 or T7iac) and the presence of repressor elements 
(lack]) on the plasmid. The resulting constructs varied in the level of expression achieved and 
in plasmid stability which facilitated the selection of a optimal expression system for 
Icrmenlation scaleup. This example involved a) construction of BotE expression vectors and 
b) determination of expression levels in small scale cultures. 

ii) Construction Of BotE Expression Vectors 

The BotE expression vectors created for fermentation culture were engineered U> 
utilize the kanamycin rather than the ampicillin resistance gene, and contained cither the T7 or 
1 7lac piomoier. with or without the laclq gene for the reasons outlined in Example 30. 

In all cases, the protein expressed by the various expression vectors is the pHisBotE 
protein described in Example 41, with the only differences between clones being the alteration 
of various regulatory elements. Using the designations outlined below, the plIisBotI- clone 
(Ex. 41) is equivalent to pHisBotE amp T7lac. 
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i) Construction Of pHisBotE kan iaclq T71ac 

pHisBolE kan laclq T71ac was constructed by inserting the Xhal/HindlU fragment of 
pHisBotE which contains the BotE gene sequences into A'/)i/l///mdlII-cleaved pET24a vector. 
Proper construction was confirmed by restriction digestion. 

ii) Construction Of plIisBotE kan T7 

pHisBoiE kan 17 was constructed by ligating the BotE-coniaining XbaMSapl fragment 
of pHisBolE kan lacIqT7lac to the T7 promoter-containing XhuVSap] fragment of pnT23a. 
Proper construction was confirmed by restriction digestion. 

iii) Construction Of pHisBotE kan laclqT7 

pHisBotE kan iaclqT7 was constructed by inserting the Z?,i:/II//y/;7dlIl fragment from 
pHisBoiE kan T7 which contains the Both gene sequences into [J.e/!I/////7dlII-cleaved pET24 
vector. Pri>pcr construction was confirmed by restriction digestion. 

b) Determination Of BotE Expression Levels In Small Scale 
Cultures 

The three BotE kan expression vectors described above were transformed into 
BI2HEJE.1) competent cells and 50 mi (2XYT 40 ^ig/ml kan) cultures were grown and 
induced with ITPG as described in Example 28. Total and soluble protein extracts from 
before and aficr induction made as described in Example 28, The total and soluble extracts 
were resolved on a 12.5% SDS-PAGE gel, and his-tagged proteins were detected on a 
Western blot utilizing the NiNTA-alkalinc phosphatase conjugate as described in Example 
31(c)(iiiK The results showed that all three BotE cell lines expressed his-tagged proteins of 
the predicted MW for the BotE protein upon induction. The results also demonstrated that 
the two constructs that contained the T? promoter expressed the BotE> protein before 
induction, while the T71ac promoter construct did not. Upon induction, the T7 promoter- 
containing constructs induced to higher levels than the T7lac*conlaining construct, with the 
pHisBotE kan Iaclqr7/BI2I(DE3) cells accumulating the maximal levels of BotE protein. 
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EXAMPLE 44 

Expression And Purification Of pHisBotE From Fermentation Cultures 

Based on the small scale inductions performed in Example 43, the pHisBolE kan laclq 
5 T7/BI21(DE3) strain was selected for fermentation scaleup. This example involved the 
fermentation and purification of recombinant BotE C fragment protein. 

A fermentation with the pHisBotE kan laclq T7/BI2!(DE3) strain was performed as 
described in Example 31. The fermentation culture was induced 2 hrs post start of the 
glucose feed with 4 gni IPTG (final concentration = 1.6 mM). The OD,^,„, was 42 at time of 
10 induction, then 46.5. 48, 53 and 54 at 1-4 hrs post induction. Viable colony counts decreased 
from 0-4 hr induction [13 L 4 (28). 7 (3), 7, 8; dilution 3 utilized 6 ^li of dilution 2 cells; 
bracketed colonies arc microcoloniesj. Ail (32/32) colonies scored at the lime of induction 
retained the BolE plasmid (kan resistant) and no colonies at induction grew on IPTG+Kan 
plates (no mutations delected). These results were mdicalive of strong promoter induction, 
15 since colony \ iability reduced after induction, and the culture stopped growing during 
fermentation (stopped at 54 OD^„/ml)- 

Total and soluble extracts were resolved on a 12.5% SDS-PAGE gel and total protein 
was detected by staining with Coomassie blue. Fhe results are shown in Figure 39. 

In Figure 39. lane I contains total protein from a pHisBolA kan T7 lac/B12t(DE3) 
20 pLysS fermentation (Ex. 24). Lanes 2-9 contain extracts prepared from the above pHisBotE 
kan laclq 1 7/BI2l(DE3) fermentation; lanes 2- 4 contain total protein extracts prepared at 0. I 
and 2 hours post- induction, respectively. Lane 5 contains a soluble protein extract prepared at 
2 hours post-induction. Lanes 6 and 7 contain total and soluble extracts prepared at 3 hours 
posi-induction. respectively. Lanes 8 and 9 contain total and soluble extracts prepared at 4 
25 hours post-induction, respectively. Lane 10 contains broad range MW markers (BioRad). 

Fhe results shown in Figure 39 demonstrate that moderate level induction of totally 
soluble Bot E protein was observed, increasing from 1 to 4 hrs post induction (no expression 
was delected in uninduced cells). From a 2 liter fermentation harvest a 155 gm (wet wt) cell 
pellet was obtained and used to make a PEl-clarified iysate (1 liter in CRB. pM 6.8). The 
30 Iysate was applied to a large scale IDA column and 200 mg of BotE protein, which was 
found to be greater than 95% pure (as judged by visual inspection of a C oomassie stained 
SDS-PAGE gel), was recovered. This represents 2.5% of the total soluble cellular protein 
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(assuming a PEI lysatc having a concentration of 8 mg prolein/ml) and corresponds to a yield 
of !()() mg BolH protein/liter of fernienialion culture. 

The above results demonstrate that high levels of the recombinant BotE protein can be 
expressed and purified from fermentation cultures. 

5 

EXAMPLE 45 

Removal Of Imidazole From Purified Recombinant BotH Protein {^reparations 

The expression of recombinant BotE protein, unlike the BoiA and EJoiB proteins, did 

10 not recjuirc the presence ol folding chaperones to maintain solubility during scale-up. A size 
exclusion chromatography step was included however to remove the imidazole from the 
sample and exchange the IDA elution buffer for one consistent with the BolA antigen. 

A Sephacryl S-100 MR (S-lOO; Pharmacia) column was poured (2 5 cm x 24 cm: bed 
volume - 1 10 ml). Under these conditions, the BotE protein should be retained by the beads 

15 to a lesser degree than the smaller imidazole, therefore the BotE protein should eluie from the 
column belbre the imidazole. The column was equilibrated in a buffer consisting of 50 niM 
sodium phosphate. 0.5 M NaCL and 10% glycerol (all reagents from Mallinkrodt). I ivc 
milliliters of the IDA-purified BotE protein (Ex. 44) was filtered through a 0.45 syringe 
filler and applied to the S-lOO column, and equilibration buffer was pumped through the 

20 column at a flow rate of I ml/minute. Eluted proteins were monitored by absorbance at 280 
nm, and collected either manually or with a fraction collector. Appropriate tubes were pooled 
i I necessary, and the protein was quanlitated by absorbance at 2K0 nm and/or IKW protein 
assay. The isolated peaks were then analyzed by native and/or SDS-PAGI: to identify the 
protein(s) and evaluate the purity. 

25 Figure 40 provides a representative ehromatogram showing the purification of 

IDA-purified BotE on the S-lOO column. Even though folding chaperones were not 
over-expressed with this antigen, a small amount of protein eiuted at a time consistent with 
the folding chaperones expressed with BotA and BotB proteins (Cjro) (see the first peak). The 
second peak in the ehromatogram contained the BotE protein, and the third peak was 

30 presumably imidazole. This presumed imidazole peak was isolated in comparable levels in 
IDA-purified BoiA and BotB protein preparations as well. 
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These results demonstrate that size exclusion chromatography can be used to remove 
imidazole and traces ot contaminating high molecular weight proteins from IDA-purificd 
BotH protein preparations. 

The S-lOO-puritled BolE protein was tested ibr endotoxin contamination using the 
* 5 LAL assay as described in Example 24. Ihis preparation was found lo contain 64 to 128 
KU/mg recombinant protein and is therefore substantially tree of endotoxin. 

The S-IOO purilied BolE was mixed with purified preparations of BoiA and BotB 
proteins and used to immunize mice: 5 |.ig of each Bot protein was used per immunization 
and alum was included as an adjuvant. Alter (wo immunizations with this irivalcnt vaccine. 
10 the immunized mice were challanged with C. hotulhmm toxin. I he immunized mice 

contained neutralizing antibodies sufficient to neutralize between 100.000 lo 1.000.000 LD^t, 
of either toxin A or toxin B and between 1.000 to 10.000 I,D,(, of toxin E. The tiler of 
neuiralizing aniibodies directed against toxin E would be expected lo increase following 
subsequent boosts with the vaccine. I hese results demonstrate thai a trivalent vaccine 
1 5 containing recombinant BoiA, BolB and BolE proteins provokes neutralizing antibodies. 

EXAMPLE 46 

Expression Of The C Fragment Of The C ' hotulinum 
Serotype C Toxin Gene And Generation Of Neuualizing Antibodies 

20 

The C hotuhnum lype CI neurotoxin gene has been cloned and sequenced [Kiniura et 
ai. (IW)) Biochem. Biophys. Res. Comm. 171:1304). The nucleotide .sequence of the toxin 
gene derived from the C* ho/ulinum type C strain C-Siockholm is available from the 
EMBl./CienBank sequence data banks under the accession number 1»0210: the nucleotide 
25 sequence of the coding region is listed in SEQ ID NO:59. The amino acid sequence of the C. 
hohilmum type CI neurotoxin derived from this strain is listed in SEQ 10 NO:60. 

The DNA sequence encoding the native C\ hotulinum serotype CI C fragment gene 
derived from the C-Slockholm strain can be expressed using the pElTlisb vector: the resuking 
coding region is listed in SEQ ID NO:6l and the corresponding amino acid sequence is listed 
-lO in SEQ ID NO:62. The C fragment region from any strain o\' C. hondimtm serotype C can be 
amplified and expressed using the approach illustrated below using the C fragment derived 
from ( hotulinum lype C C-Stockholm strain. Expression of the C fragment of C. hotulinum 
type C\ toxin in heterologous hosts {e.^., E. colt) has not been previously reported. 
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The C fragment of the C. hotulimm serotype CI (BotCl) toxin gene is cloned using 
the protocols and conditions described in Example 28 for the isolation of the native BotA 
gene. A number of L\ hoiulinum serotype C strains (expressing either or both CI and C2 
toxin) arc available from the ATCC lt^^^, 2220 (ATCC 17782), 2239 (ATCC 17783). 2223 
5 (A lCC 17784: a type C-p strain: C-(^ strains produce C2 toxin). 662 (ATCC 17849: a type 
C-a strain: C-rx strains produce mainly CI toxin and a small amount of C2 toxin). 2021 
(ATCC 17850: a type C-a strain) and VPI 3803 (ATCC 25766)]. Alternatively, other type C 
strains may be employed for the isolation of sequences encoding the C fragment of C\ 
botuiinum serotype C toxin, 

10 The following primer pair is used to amplify the BotC gene: 5 -CGCCATGGC 

TTTATTAAAAGATAFAA rTAATG-3' f5' primer, engineered AVoI site underlined (SEQ ID 
NO:63)| and 5'-GC AAGCTT 7T/j rTCACTTACA(}GTAC AAAACC-3' f3' primer, 
engineered //mdlll site underlined, native gene termination codon italicized (SEQ 113 
N(3:64)l. l ollowing PCR amplification, the PCR product is inserted into the pCRscript vector 

15 and then the L5 kb Iraumenl is cloned into pI'THish vector as described for BotA C fragment 
gene in Example 28. The resulting construct is termed pllisBoiC. Proper construction is 
conlkmed by DNA sequencing of the BotC sequences contained within pllisBolC. 

pHisBotC expresses the BotC gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-ierminal lOXIIis-lng uftlnity taiz. I he 

20 pHisBotC expression construct is transformed into B1.21(DE3) pl.ysS competent cells and 1 
liter cultures are grown, induced and his-iagged proteins are purillcd utilizing a NiNTA resin 
(eluted in 250 mM imidazole. 20% glycerol) as described in Example 28, l oial. soluble and 
purified proteins are resolved by SDS-PAGE and detected by Coomassie staining and Western 
blot hybridization utilizing a Ni-NTA-alkaline phosphatase conjugate (Qiagen) which 

25 recognizes his-tagged proteins as described in Example 31(c)(iii). This analysis permits the 

deienni nation of expression levels of the pIIisBotC protein {i.e.. number of mg/liter expressed 
as a stiluble protein). The purified BotC protein will migrate as a siniile band of the predicted 
MW U l\. --SOkDY 

The level of expression of the pHisBotC protein may be modified (increased) by 
30 substitution oi' the 17 promoter lor the r7lac promoter, or by inclusion of the laclq gene on 
the expression plasmid. and plasmid expressed in BL21(DE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels [i c„ less than 0.5%) of soluble 
pHisBcnC protein are expressed using the above expression systems, the pHisBotC construct 
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may be co-expressed with pACYCGro construct as described in Example 32. In this case, the 
recombinant BotC protein may co-purify with the folding chaperoncs. The contaminating 
chaperones may be removed as described in Example 34. Preparations of purified pHisBotC 
protein arc tested for endotoxin contamination using the LAL assay as described in F:xample 
24- 

The purifed pHisBotC protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotC protein using Gerbu GMDP adjuvant (CC Biotech) as 
described in Example 36. The ability of the anti-BotC antibodies to neutralize native C. 
hoiulinum type C toxin is demonstrated using the mouse-C . hofulinuf}! neutralization model 
described in Example 36. 

EXAMPLE 47 

lixprcssion Of The C Fragment Of The C hntulimmi 
Serotype D Toxin Gene And Generation Of Neutralizing Antibodies 

The ( '. hofulinum type D neurotoxin gene has been cloned and sequenced fSunagawa 
el al. ( 1992) J. Vet. Med. Sci. 54:905 and Binz et al, (1990) Nucleic Acids Res. 18:5556]. 
The nucleotide sequence o( the toxin gene derived From the CB16 strain is available from the 
EMBL/GcnBank sequence data banks under the accession number S49407: the nucleotide 
sequence of the coding region is listed in SEQ 10 NO:65. The amino acid sequence of the C\ 
hoiulinum type D neurotoxin derived from the CB16 strain is listed in SEQ ID NO:66. 

f he DNA sequence encoding the native C hotulimtm serotype D C fragment gene 
derived from a BotD expressing strain can be expressed using the pETIIisb vector: the 
resulting coding region is listed in SEQ ID NO:67 and the corresponding amino acid sequence 
IS listed in SEQ ID NO:68. The C fragment region from any strain of ( hoiuliman serotype 
D can be amplified and expressed using the approach illustrated below using the C fragment 
derived from C. hotulimtm type D. CB16 strain. Expression of the C fragment of C 
hoiitlimtm type D toxin in heterologous hosts (e,^., £. coli) has not been previously reported, 

fhe C fragment of the ( hofulinum serotype D (BotD) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BolA gene. 
A number of hotulinum type D strains are available from the ATCC [c.i:., ATCC 9633. 
2023 (ATCC 17851). and VPI 5995 (ATCC 27517)]. 
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The following primer pair is used to amplify ihe BotD gene: 5'-CG CCATGG C 
TTTATTAAAAGATATAATTAATG-3^ [5' primer, engineered Nco\ site underlined (SEQ ID 
NO:63)] and 5'-GC AAGCTT 7T^CTCTACCCATCCTGGATCCCT-3' {T primer, engineered 
JUndlU site underlined, native gene termination codon italicized (SEQ ID NO:69)]. 
5 Following PCR amplification, the PGR product is inserted into the pCRscript vector and then 
the 1.5 kb fragment is cloned into pETHisb vector as described for BotA C fragment gene in 
F'xample 28. The resulting construct is termed pHisBotD. 

pHisBotD expresses the BotD gene sequences under the transcriptional control of the 
T7 lac promoter and the resulting protein contains an N-terminal lOXHis-iag affinity tag. The 
10 pHisBotD expression construct is transformed into BL21{DE3) pLysS competent cells and 1 
liter cultures are grown, induced and his-tagged proteins arc purified utilizing a NiNTA resin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and detected by Coomassie staining and Western blot hybridization utilizing a Ni-NTA- 
alkalinc phosphatase conjugate (Qiagen) which recognizes his-tagged proteins as described in 
15 Example 3l(c)(iii). This analysis permits the determination of expression levels of the 

pHisBotD protein (i.e.. number of mg/liier expressed as a soluble protein). The purified BotD 
protein will migrate as a single band of the predicted MW {i.e., -oOkD). 

rhe level of expression of the pHisBotD protein may be modified (increased) by 
substitution of the T7 promoter for the T7lac promoter, or by inclusion of the laclq gene on 
20 the expression plasmid. and plasmid expressed in BE2l(DE3) cell lines in fermentation 

cultures as described hi Example 30. If only very low levels {i.e.. less than about 0.5%) of 
soluble pHisBotD protein are expressed using the above expression systems, the pHisBotD 
construct may be co-expressed with pACYCGro construct as described m Example 32. In 
this case, the recombinant BotD protein may co-purify with the folding chaperones. 1 he 
25 contaminating chaperones may be removed as described in E.xamplc 34. Preparations ol 
purified pHisBotD protein are tested for endotoxin contamination using the LAL assay as 
described in Example 24. 

The purifed pHisBotD protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotD protein using Gcrbu GMDP adjuvant (CC Biotech) as 
30 described in Example 36. 1 he ability of the anli-BotD antibodies lo neutralize native (' 

hatulinum type D toxin is demonstrated using the mouse-C botulimtm neutralization model 
described in Example 36. 
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EXAMPLE 48 

Expression Of The C Fragment Of The C botidinum 
Serotype F Toxin Gene And Generation Of Neutralizing Antibodies 

- 5 The C' hotulinum type F neurotoxin gene has been cloned and sequenced [East cf ai 

(iW2) FEMS Microbiol. Lett. 96:225]. The nucleotide sequence of the toxin j^cne derived 
from the 202F strain (ATCC 23387) is available from the EMBL/GenBank sequence data 
banks under the accession number M92906; the nucleotide sequence of the coding region is 
listed in SEQ 10 NO:70. The amino acid sequence of the C, hotulinum type F neurotoxin 

iO derived from the 202F strain is listed in SEQ ID NO:7L 

The ONA sequence encoding the native C. boiulimmi serotype F C tVagmem gene 
derived from the 2()2F strain can be expressed using the pETHisb vector: the resuhing coding 
region is listed in SEQ ID NO:72 and the corresponding amino acid sequence is listed in SEQ 
ID N():73, I he C fragment region from any strain of C'. ho/uiinum serotype F can be 

15 amplilied and expressed using the approach illustrated below using the C fragment derived 

from (' haiuliwim type F 2021* strain. Expression of the C fragment o\ C\ hoiulimtm type F 
toxin in heterologous hosts (c.^f.. coli) has not been previously reported. 

1 he C fragment of the hondinum serotype F (Botl ) toxin gene is cloned using the 
protocols and conditions described in Example 28 for the isolation of the native BotA gene. 

20 ( he ( homUmmi type F 202F strain is obtained from the American fype Culture Collection 
(ATCC 23387). Alternatively, sequences encoding the BoiF toxin may be isolated from any 
BotF expressing strain \e.^„ VPl 4404 (ATCC 25764), VPI 2382 (ATCC 27321 ) and 
Lungeiand (ATCC 35415)1. 

I he following primer pair is used to amplify the BotF gene: 5*-CCiCCATGGC 

25 TA FTC TAA FTAl ATATTTTAA'rAG-3" [5" primer, engineered .Vcf>l site underlined (SEQ 
ID NO:74)) and 5 -GC AAGCTT rC.4'rTCTTTCCATCCATTCTC-3- [3' primer, engineered 
HindWX site underlined, native gene termination codon italicized (SEQ ID NO:75)]. 
Following FCR amplillcation* the PCR product is inserted into the pCRscripi vector and then 
the 1.5 kb fragment is cloned into pETHisb vector as described for BolA C fragment gene in 

30 Example 28. The resulting construct is termed pHisBotF. 

pMisBolF expresses the BotF gene sequences under the transcriptional control of the 
r7 lac promoter and the resulting protein contains an N-termina! lOXHis-iag affinity tag. The 
pHisBoiF expression construct is transformed into BL2I(DE3) pLysS competent cells and 1 
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liler cultures arc grown, induced and his-tagged proteins arc purified utilizing a NiNTA resin 
as described in Example 28, Total, soluble and purified proteins arc resolved by SDS-PAGE 
and detected by Coomassic staining and Western blot hybridi^oition utilizing a Ni-NTA- 
alkaline phosphatase conjugate (Qiagen) which recognizes his-taggcd proteins as described in 
Example 3 l(c)(iii). This analysis permits the determination of expression levels of the 
pHisBolF protein {ix\. number of mg/litcr expressed as a soluble protein). The purified BotF 
protein will migrate as a single band of the predicted MW {i.e. - 50kD), 

The level of expression of the pHisBotF protein may be modified (increased) by 
substitution of the T7 promoter for the T71ac promoter, or by inclusion of the laclq gene on 
the expression plasmid, and piasmid expressed in BL21(DE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels (/. t\. less than about 0.5%) of 
soluble plIisBotF protein are expressed using the above expression systems, the pilisBolF' 
construct may be co-expressed with pACYCGro construct as described in Example 32. In 
this case, the recombinant BoiF protein may co-purify with the folding chaperones. The 
contaminating chaperones may be removed as described in Example 34. Preparations of 
purified pl iisHotF protein arc tested for endotoxin contamination using the LAL assay as 
described in Example 24. 

fhe purifed pUisBolE protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the Boll-' protein using Gcrbu (]MOP adjuvant (CC Biotech) as 
described in Example 36. The ability of the anii-BolF antibodies to neutralize native C 
botutwum type V toxin is demonstrated using the mouse-C. hniuiinum neutralization model 
described in Example 36. 

EXAMPLE 49 

Expression Of The C Fragment Of The C \ InUiilinum 
Serotype G Toxin Cicne And Generation Of Neutralizing Antibodies 

The ( hniuiimim type G neurotoxin gene has been cloned and sequenced | Campbell ct 
til. (1993) l^iochimica ei Biophysica Acta 1216:487 and Binz ct ul. (1990) Nucleic Acids Res. 
18:5556|. The nucleotide sequence of the toxin gene derived from the 1 13/30 strain (NCFB 
3012) is available from the EMBL/GenBank sequence data banks under the accession number 
X74162; the nucleotide sequence of the coding region is listed in SE(^ ID NO:76. The amino 
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acid sequence of the C. hoiuliniim type G neurotoxin derived from this strain is listed in SEQ 
IDNO:77. 

The DNA sequence encoding the native C. hofuiinum serotype G C fragment gene 
derived from the 1 13/30 strain can be expressed using the pETHisb vector: the resuking 
5 coding region is listed in SEQ ID NO:78 and the corresponding amino acid sequence is listed 
in SEQ ID NO:79. The C fragment region from any strain of C^. Ao/w/mww serotype G can 
be amplified and expressed using the approach illustrated below using the C fragment derived 
from r boddimim type G 113/30 strain. Expression of the C fragment of C. homlmum type 
G toxin in heterologous hosts {e.y,,. E. coli) has not been previously reported. 

10 The C fragment of the C*. hoiuUnum serotype G (BotG) toxin gene is cloned using the 

proiocois and conditions described in Example 28 for the isolation of the native BolA gene. 
I hc ' hoiulimm type G 113/30 strain is obtained from the NCFB, The following primer 
pair is used to amplify the BotG gene: 5 -CG CCATGG CTGAC ACAA'rTTTAATACA 
AG 1-3* [5' primer, engineered Na>\ site underlined (SEQ ID NO:80)) and 

15 5 -GCXTCGA(ir/>rrTCTGTCCA l'CCTTCA1CCAC-3^ [3^ primer, engineered Xbo\ site 
underlined, native gene termination codon italicized (SEQ ID NO:8l )l. l-olhwing PGR 
amplification, the PGR product is inserted into the pCRscript vector and then the 1.5 kb 
fragment is cloned into pETHisb vector as described for BotA C fragment gene in Example 
28 with the exception that the sequences encoding BotCi are excised from the pCRscripi 

20 vector by digestion with Ncol and Xho\ and the Nco\ site is blunted (the BotG sequences 
contain an internal ////jdlll site). This Sco\(r\\\c6)IXho\ fragment is then ligated to the 
pETHisb vector which has been digested with Nhe\ and Sal\ and the /\7;<.>l site is blunted. 
The resulting construct is termed pMisBolG, 

pHisBotCi expresses the BotG gene sequences under the transcriptional control of the 

25 f7 lac promoter and the resulting protein contains an N-lerminal lOXHis-tag aftlniiy lag. The 
pl IisBotG expression construct is transformed into BL21(DE3) pLysS competent cells and I 
liter cultures are grown, induced and his-tagged proteins are purified utilizing a NiNTA resin 
as described in Example 28. Total, soluble and purified proteins are resolved by SDS-PAGE 
and delected by Coomassie staining and Western blot hybridization utilizing a Ni-NTA- 

30 alkaline phosphatase conjugate (Qiagen) which recognizes his-tagged proteins as described in 
Example 31(cKiii). This analysis permits the determination of expression levels of the 
pHisBotG protein {i.e.. number of mg/l iter expressed as a soluble protein). The purified BotG 
protein will migrate as a single band of the predicted MW (/.cr.. -50kD). 
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The level of expression of the pHisBotG protein may be modified (increased) by 
substitution of the T7 promoter for the T71ac promoter, or by inclusion of the laclq gene on 
the expression plasmid, and plasmid expressed in BL2l(DE3) cell lines in fermentation 
cultures as described in Example 30. If only very low levels {i.c, less than about 0.5%) of 
^ soluble pHisBotG protein are expressed using the above expression systems, the pHisBotCi 
construct may be co-expressed with pACYCGro construct as described in llxample 32. In 
this case, the recombinant BotG protein may co-purify with the folding chapcrones. The 
contaminating chaperones may be removed as described in Example 34. Preparations of 
purified pHisBotG protein are tested for endotoxin contamination using the LAL assay as 

10 described in Example 24. 

fhe purifed pHisBolG protein is used to generate neutralizing antibodies. BALBc 
mice are immunized with the BotG protein using Gerbu GMDP adjuvant (C'C Biotech) as 
described in Example 36. The ability of the anii-BoiG antibodies to neuirali/.c native C \ 
hottilinum type Ci toxin is demonstrated using the mousc-C. hotidinun} neutralization model 

15 described in Example 36. 

EXAMPLE 50 

Expression Of Recombinant Bolulinal Toxin Proteins In Eucaryotic Most Cells 

-^^ Recombinant botulinal C fragment proteins may be expres.sed in eucaryoiic host cells, 

such as yeasi and insect cells. 

a) Expression In Yeast 

Botulinal C fragments derived from serotypes A. B. C. D. E, F and G may be 
25 expressed in yeast cells using a variety of commercially available vectors. I or example, the 
pPlC3K and pPlC9K expression vectors (Inviirogen) may be employed for expression in the 
methylotrophic yeasl. Pichia pasioris. When the pPlC3K vector is employed, expression of 
the botulinal C fragment protein will be intracellular. When the pIMC3K. vector is employed, 
the botulinal C Iragmeni protein will be secreted (the alpha factor secretion signal is provided 
30 on the pPlC9K vector). 

DNA sequences encoding the desired C fragment is inserted into these vectors using 
techniques known to the art. Briefly, the desired botulinal expression cassette (including 
sequences encoding the his-tag; described in the preceding examples) is amplilled using the 
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PGR in conjunction with primers that incorporate unique restriction sites at the termini of the 
amplified fragment. Suitable restriction enzyme sites include SnaB\, EcoRl. Avrll and Noil. 
When the botulinal C fragment is to be expressed using the pPIC3K vector, the initiator 
methionine (ATG) is provided by the desired Bot gene sequence and a Kozak consensus 
• 5 sequence is engineered upstream of the ATG ACC ATG G), 

. I'he amplified restriction fragment containing the botulinal C fragment gene is then 
cloned into the desired expression vector. Recombinant clones are integrated into the Pichia 
pasioris genome and recombinant protein expression is induced using methanol following the 
manufacturer s instructions (Invitrogen Pichia expression kit manual). 

1(> C, hoiulinum genes are A/T rich and contain multiple sequences that arc similar to 

yeast transcriptional termination signals (e,g.^ TTTTTATA). If premature transcription 
termination is observed when the botulinal C fragment genes are expressed in yeast, the 
transcription termination signals present in the C fragment genes can be removed by cither 
site directed mutagenesis (utilizing the pALTER system: Promega) or by construction ol* 

15 synthetic genes utilizing overlapping synthetic primers. 

Fhe botulinal C fragment genes may be expressed in other yeast cells using other 
commercially available vectors ft^j^'., using the pYES2 vector (Invitrogen) and .S^. cerevisiae 
cells (Invitrogen)]. 

20 b) Expression In Insect Cells 

Botulinal C fragments derived from serotypes A, B. C. D. E. F and G may be 
expressed in insect cells using a variety of commercially available vectors. For example, the 
pBlueBac4 transfer vector (Invitrogen) may be employed for expression in Spodopiera 
fniy:ipcida (Sf)) insect cells (baculoviais expression system) (equivalent baculovirus vectors 

25 and host cells are avaialble from other vendors, e.j^., Pharmingen, San Diego. CA). Botulinal 
C fragments contained on NcoMHitidXW fragments contained within the pHisBotA-G 
expression constructs (described in the preceding examples) are cloned into the pBIueBac4 
vector (digested with Nco\ and HindXW): the Ncol site present on the C fragment constructs 
overlaps with the start codon of the fusion proteins. In the case of botulinal C fragment 

30 clones that contain internal HmdXW sites (e.jt;., using the BotG sequences described in Ex. 49). 
the C fragment gene is contained within a NcoMXhoX fragment on the pHisBot construct. 
This NcoMXhoX fragment is excised from pHisBot and inserted into pBlueBac4 digested with 
Nc{}\ and Sat\, Recombinant baculoviruses are made and the desired recombinant C fragment 
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is expressed in Sf9 cells using the protocols provided by the manufacturer (Invitrogen MaxBac 
manual). The resulting constructs will express the pHisBot protein inlracellularly (including 
the N-tcrminal his-tag) under the control of tiic polyhedrin promoter. For extracellular 
secretion of botulinal C fragment proteins, the C fragment sequences from the pHisBot 

5 constructs arc cloned into the pMelBacB vector (Invitrogen) as described above for the 
pBlueBac4 vector. When the pMelBacB vector is employed, the his-tagged botulinal C 
fragment proteins are secreted (utilizing a vector-encoded honeybee mclittin secretion signal) 
and contain a nine amino acid extension at the N-terminus. 

His-tagged botulinal C fragments expressed in yeast or insect cells arc purified using 

10 metal chelation columns as described in the preceding examples. 

f rom the above it is clear that the present invention provides compositions and 
methods for the preparation of effective multivalent vaccines against C. botulinum neurotoxin. 
It is also contemplated that the recombinant botulinal proteins he used for the production of 
15 antitoxins. All publications and patents mentioned in the alx^ve specification are herein 

incorporated hy reference. Various modifications and variations ol' the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. 



- 218 - 



PCT/US97a53!M 



10 



40 



50 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT; Williams, James A. 

Thai ley, Bruce S. 

(ii) TITLE OF INVENTION: Multivalent Vaccine For Clostridium 
Botulinum Neurotoxin 

(iii) NUMBER OF SEQUENCES: 82 



(iv) CORRESPONDENCE ADDRESS: 

{A} ADDRESSEE: Medlen & Carroll 

15 (B) STREET: 220 Montgomery Street, Suite 2200 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: United States of America 

(F) ZIP: 94104 

20 

{V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

-5 (D) SOFTWARE: Patentin Release #1.0, Version #1.20 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

30 (C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME; Carroll, Peter G. 

(B) REGISTRATION NUMBER: 32,837 

35 {O REFERENCE/DOCKET NUMBER: OPHD-02959 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: i415) 705-8410 

(B) TELEFAX: (415) 397-8338 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
4^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(li) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GGAAATTTAG CTGCAGCATC TGAC 24 
55 (2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2; 

TCTAGCAAAT TCGCTTGTGT TGAA 24 
(2) INFORMATION FOR SEQ ID NO: 3: 

70 
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(x) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CTCGCATATA GCATTAGACC 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE; nucleic acid 

(C) STELANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CTATCTAGGC CTAAAGTAT 
{2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.8130 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATG TCT TTA ATA TCT AAA GAA GAG TTA ATA AAA CTC GCA TAT AGC ATT 
Met Ser Leu lie Ser Lys Glu Glu Leu lie Lys Leu Ala Tyr Ser lie 
i 5 10 15 

AGA CCA AGA GAA AAT GAG TAT AAA ACT ATA CTA ACT AAT TTA GAC GAA 
Arg Pro Arg Glu Asn Glu Tyr Lys Thr He Leu Thr Asn Leu Asp Glu 
20 25 30 

TAT AAT AAG TTA ACT ACA AAC AAT AAT GAA AAT AAA TAT TTG CAA TTA 
Tyr Asn Lys Leu Thr Thr Asn Asn Asn Glu Asn Lys Tyr Leu Gin Leu 
35 40 45 

AAA AAA CTA AAT GAA TCA ATT GAT GTT TTT ATG AAT AAA TAT AAA ACT 
Lys Lys Leu Asn Glu Ser He Asp Val Phe Met Asn Lys Tyr Lys Thr 
50 55 60 

TCA AGC AGA AAT AGA GCA CTC TCT AAT CTA AAA AAA GAT ATA TTA AAA 
Ser Ser Arg Asn Arg Ala Leu Ser Asn Leu Lys Lys Asp He Leu Lys 
65 70 75 80 

GAA GTA ATT CTT ATT AAA AAT TCC AAT ACA AGC CCT GTA GAA AAA AAT 
Glu Val He Leu He Lys Asn Ser Asn Thr Ser Pro Val Glu Lys Asn 
85 90 95 

TTA CAT TTT GTA TGG ATA GGT GGA GAA GTC AGT GAT ATT GCT CTT GAA 
Leu His Phe Val Trp He Gly Gly Glu Val Ser Asp He Ala Leu Glu 
100 105 110 
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TAC ATA AAA CAA TGG GCT GAT ATT AAT GCA GAA TAT AAT ATT AAA CTG 384 
Tyr He Lys Gin Trp Ala Asp He Asn Ala Glu Tyr Asn He Lys Leu 
115 120 125 

5 TGG TAT GAT AGT GAA GCA TTC TTA GTA AAT ACA CTA AAA AAG GCT ATA 432 

Trp Tyr Asp Ser Glu Ala Phe Leu Val Asn Thr Leu Lys Lys Ala He 
130 135 140 

GTT GAA TCT TCT ACC ACT GAA GCA TTA CAG CTA CTA GAG GAA GAG ATT 480 
10 Val Glu Ser Ser Thr Thr Glu Ala Leu Gin Leu Leu Glu Glu Glu He 
145 150 15S 160 

CAA AAT CCT CAA TTT GAT AAT ATG AAA TTT TAC AAA AAA AGG ATG GAA 528 
Gin Asn Pro Gin Phe Asp Asn Met Lys Phe Tyr Lys Lys Arg Met Glu 
15 165 170 175 

TTT ATA TAT GAT AG A CAA AAA AGG TTT ATA AAT TAT TAT AAA TCT CAA 576 
Phe He Tyr Asp Arg Gin Lys Arg Phe He Asn Tyr Tyr Lys Ser Gin 
180 185 190 

20 

ATC AAT AAA CCT ACA GTA CCT ACA ATA GAT GAT ATT ATA AAG TCT CAT 624 
He Asn Lys Pro Thr Val Pro Thr He Asp Asp He He Lys Ser His 
195 200 205 

25 CTA GTA TCT GAA TAT AAT AGA GAT GAA ACT GTA TTA GAA TCA TAT AGA 672 

Leu Val Ser Glu Tyr Asn Arg Asp Glu Thr Val Leu Glu Ser Tyr Arg 
210 215 220 

ACA AAT TCT TTG AGA AAA ATA AAT AGT AAT CAT GGG ATA GAT ATC AGG 720 
30 Thr Asn Ser Leu Arg Lys He Asn Ser Asn His Gly lie Asp He Arg 

225 230 235 240 

GCT AAT AGT TTG TTT ACA GAA CAA GAG TTA TTA AAT ATT TAT AGT CAG 76 8 

Ala Asn Ser Leu Phe Thr Glu Gin Glu Leu Leu Asn He Tyr Ser Gin 
35 245 250 255 

GAG TTG TTA AAT CGT GGA AAT TTA GCT GCA GCA TCT GAC ATA GTA AGA 816 

Glu Leu Leu Asn Arg Gly Asn Leu Ala Ala Ala Ser Asp He Val Arg 

260 265 270 

40 

TTA TTA GCC CTA AAA AAT TTT GGC GGA GTA TAT TTA GAT GTT GAT ATG 864 

Leu Leu Ala Leu Lys Asn Phe Gly Gly Val Tyr Leu Asp Val Asp Met 

275 280 285 

45 CTT CCA GGT ATT CAC TCT GAT TTA TTT AAA ACA ATA TCT AGA CCT AGC 912 

Leu Pro Gly He His Ser Asp Leu Phe Lys Thr He Ser Arg Pro Ser 
290 295 ' 300 

TCT ATT GGA CTA GAC CGT TGG GAA ATG ATA AAA TTA GAG GCT ATT ATG 960 
50 ser He Gly Leu Asp Arg Trp Glu Met He Lys Leu Glu Ala He Met 

30S 310 315 320 

AAG TAT AAA AAA TAT ATA AAT AAT TAT ACA TCA GAA AAC TTT GAT AAA 1008 
Lys Tyr Lys Lys Tyr He Asn Asn Tyr Thr Ser Glu Asn Phe Asp Lys 
55 325 330 335 

CTT GAT CAA CAA TTA AAA GAT AAT TTT AAA CTC ATT ATA GAA AGT AAA 1056 
Leu Asp Gin Gin Leu Lys Asp Asn Phe Lys Leu He He Glu Ser Lys 
340 345 350 



60 



AGT GAA AAA TCT GAG ATA TTT TCT AAA TTA GAA AAT TTA AAT GTA TCT 1104 
Ser Glu Lys Ser Glu He Phe Ser Lys Leu Glu Asn Leu Asn Val Ser 
355 360 365 



65 GAT CTT GAA ATT AAA ATA GCT TTC GCT TTA GGC AGT GTT ATA AAT CAA 1152 

Asp Leu Glu He Lys lie Ala Phe Ala Leu Gly Ser Val He Asn Gin 
370 375 380 
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GCC TTG ATA TCA AAA CAA GGT TCA TAT CTT ACT AAC CTA GTA ATA GAA 1200 
Ala Leu lie Ser Lys Gin Gly Ser Tyr Leu Thr Asn Leu Val lie Glu 
385 390 395 400 

CAA GTA AAA AAT AG A TAT CAA TTT TTA AAC CAA CAC CTT AAC CCA GCC 1248 
Gin Val Lys Asn Arg Tyr Gin Phe Leu Asn Gin His Leu Asn Pro Ala 
405 410 415 

ATA GAG TCT GAT AAT AAC TTC ACA GAT ACT ACT AAA ATT TTT CAT GAT 1296 
tie Glu Ser Asp Asn Asn Phe Thr Asp Thr Thr Lys lie Phe His Asp 
420 425 430 



TCA TTA TTT AAT TCA GCT ACC GCA GAA AAC TCT ATG TTT TTA ACA AAA 1344 
^ Ser Leu Phe Asn Ser Ala Thr Ala Glu Asn Ser Met Phe Leu Thr Lys 

1^ 435 440 445 



ATA GCA CCA TAG TTA CAA GTA GGT TTT ATG CCA GAA GCT CGC TCC ACA 13 92 

lie Ala Pro Tyr Leu Gin Val Gly Phe Met Pro Glu Ala Arg Ser Thr 
450 455 460 

ATA AGT TTA AGT GGT CCA GGA GCT TAT GCG TCA GCT TAC TAT GAT TTC 14 4 0 

He Ser Leu Ser Gly Pro Gly Ala Tyr Ala Ser Ala Tyr Tyr Asp Phe 

465 470 475 480 

ATA AAT TTA CAA GAA AAT ACT ATA GAA AAA ACT TTA AAA GCA TCA GAT 14 88 

He Asn Leu Gin Glu Asn Thr He Glu Lys Thr Leu Lys Ala Ser Asp 
485 490 495 

TTA ATA GAA TTT AAA TTC CCA GAA AAT AAT CTA TCT CAA TTG ACA GAA 1536 

Leu He Glu Phe Lys Phe Pro Glu Asn Asn Leu Ser Gin Leu Thr Glu 
500 505 510 



CAA GAA ATA AAT AGT CTA TGG AGC TTT GAT CAA GCA AGT GCA AAA TAT 1584 
Gin Glu He Asn Ser Leu Trp Ser Phe Asp Gin Ala Ser Ala Lys Tyr 
-'^ 515 520 525 



CAA TTT GAG AAA TAT GTA AGA GAT TAT ACT GGT GGA TCT CTT TCT GAA 1632 
Gin Phe Glu Lys Tyr Val Arg Asp Tyr Thr Glv Gly Ser Leu Ser Glu 
530 535 ' 540 

GAC AAT GGG GTA GAC TTT AAT AAA AAT ACT GCC CTC GAC AAA AAC TAT 16 80 

Asp Asn Gly Val Asp Phe Asn Lys Asn Thr Ala Leu Asp Lys Asn Tyr 
545 550 555 560 

45 TTA TTA AAT AAT AAA ATT CCA TCA AAC AAT GTA GAA GAA GCT GGA AGT 1 728 

Leu Leu Asn Asn Lys He Pro Ser Asn Asn Val Glu Glu Ala Gly Ser 
565 570 575 



AAA AAT TAT GTT CAT TAT ATC ATA CAG TTA CAA GGA GAT GAT ATA AGT 1776 
Lys Asn Tyr Val His Tyr He He Gin Leu Gin Gly Asp Asp He Ser 
580 585 590 



TAT GAA GCA ACA TGC AAT TTA TTT TCT AAA AAT CCT AAA AAT AGT ATT 1824 
Tyr Glu Ala Thr Cys Asn Leu Phe Ser Lys Asn Pro Lys Asn Ser He 
595 600 605 



ATT ATA CAA CGA AAT ATG AAT GAA AGT GCA AAA AGC TAC TTT TTA AGT 18 72 

He He Gin Arg Asn Met Asn Glu Ser Ala Lys Ser Tyr Phe Leu Ser 
610 615 620 

GAT GAT GGA GAA TCT ATT TTA GAA TTA AAT AAA TAT AGG ATA CCT GAA 192 0 

Asp Asp Gly Glu Ser He Leu Glu Leu Asn Lys Tyr Arg He Pro Glu 
625 630 635 640 

AGA TTA AAA AAT AAG GAA AAA GTA AAA GTA ACC TTT ATT GGA CAT GGT 1968 
Arg Leu Lys Asn Lys Glu Lys Val Lys Val Thr Phe He Gly His Gly 
645 650 655 
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AAA GAT GAA TTC AAC ACA AGC GAA TTT GCT AGA TTA ACT GTA GAT TCA 2016 

Lys Asp Glu Phe Asn Thr Ser Glu Phe Ala Arg Leu Ser Val Asp Ser 
660 665 670 

5 CTT TCC AAT GAG ATA AGT TCA TTT TTA GAT ACC ATA AAA TTA GAT ATA 2 064 

Leu Ser Asn Glu lie Ser Ser Phe Leu Asp Thr lie Lys Leu Asp He 
675 680 685 

TCA CCT AAA AAT GTA GAA GTA AAC TTA CTT GGA TGT AAT ATG TTT AGT 2112 

10 Ser Pro Lys Asn Val Glu Val Asn Leu Leu Gly Cys Asn Met Phe Ser 

690 695 700 

TAT GAT TTT AAT GTT GAA GAA ACT TAT CCT GGG AAG TTG CTA TTA AGT 2160 

Tyr Acp Phe Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Ser 

15 705 710 715 720 

ATT ATG GAC AAA ATT ACT TCC ACT TTA CCT GAT GTA AAT AAA AAT TCT 2208 

He Met Asp Lys lie Thr Ser Thr Leu Pro Asp Val Asn Lys Asn Ser 
725 730 735 

20 

ATT ACT ATA GGA GCA AAT CAA TAT GAA GTA AGA ATT AAT AGT GAG GGA 2256 

He Thr He Gly Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly 
740 745 750 

25 AGA AAA GAA CTT CTG GCT CAC TCA GGT AAA TGG ATA AAT AAA GAA GAA 2304 

Arg Lys Glu Leu Leu Ala His Ser Gly Lys Trp He Asn Lys Glu Glu 
755 760 765 

GCT ATT ATG AGC GAT TTA TCT AGT AAA GAA TAC ATT TTT TTT GAT TCT 2 3 52 

30 Ala He Met Ser Asp Leu Ser Ser Lys Glu Tyr He Phe Phe Asp Ser 

770 775 780 

ATA GAT AAT AAG CTA AAA GCA AAG TCC AAG AAT ATT CCA GGA TTA GCA 24 00 

He Asp Asn Lys Leu Lys Ala Lys Ser Lys Asn He Pro Gly Leu Ala 

35 785 790 795 800 

TCA ATA TCA GAA GAT ATA AAA ACA TTA TTA CTT GAT GCA AGT GTT AGT 244 8 

Ser He Ser Glu Asp He Lys Thr Leu Leu Leu Asp Ala Ser Val Ser 
805 810 815 

40 

CCT GAT ACA AAA TTT ATT TTA AAT AAT CTT AAG CTT AAT ATT GAA TCT 24 96 

Pro Asp Thr Lys Phe He Leu Asn Asn Leu Lys Leu Asn He Glu Ser 
820 825 830 

45 TCT ATT GGG GAT TAC ATT TAT TAT GAA AAA TTA GAG CCT GTT AAA AAT 2 544 

Ser He Gly Asp Tyr He Tyr Tyr Glu Lys Leu Glu Pro Val Lys Asn 
835 840 845 

ATA ATT CAC AAT TCT ATA GAT GAT TTA ATA GAT GAG TTC AAT CTA CTT 25 92 

50 He He His Asn Ser He Asp Asp Leu He Asp Glu Phe Asn Leu Leu 

850 855 860 

GAA AAT GTA TCT GAT GAA TTA TAT GAA TTA AAA AAA TTA AAT AAT CTA 264 0 

Glu Asn Val Ser Asp Glu Leu Tyr Glu Leu Lys Lys Leu Asn Asn Leu 

55 865 870 875 880 

GAT GAG AAG TAT TTA ATA TCT TTT GAA GAT ATC TCA AAA AAT AAT TCA 26 88 

Asp Glu Lys Tyr Leu lie Ser Phe Glu Asp He Ser Lys Asn Asn Ser 
885 890 895 



60 



ACT TAC TCT GTA AGA TTT ATT AAC AAA AGT AAT GGT GAG TCA GTT TAT 2736 
Thr Tyr Ser Val Arg Phe He Asn Lys Ser Asn Gly Glu Ser Val Tyr 
900 905 910 



65 GTA GAA ACA GAA AAA GAA ATT TTT TCA AAA TAT AGC GAA CAT ATT ACA 2784 

Val Glu Thr Glu Lys Glu He Phe Ser Lys Tyr Ser Glu His He Thr 
915 920 925 
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AAA GAA ATA AGT ACT ATA AAG AAT AGT ATA ATT ACA GAT GTT AAT GGT 2832 

Lys Glu He Ser Thr He Lys Asn Ser He He Thr Asp Val Asn Gly 

930 935 940 

5 AAT TTA TTG GAT AAT ATA CAG TTA GAT CAT ACT TCT CAA GTT AAT ACA 2880 

Asn Leu Leu Asp Asn He Gin Leu Asp His Thr Ser Gin Val Asn Thr 
945 950 955 360 

TTA AAC GCA GCA TTC TTT ATT CAA TCA TTA ATA GAT TAT AGT AGC AAT 2 928 

10 Leu Asn Ala Ala Phe Phe He Gin Ser Leu He Asp Tyr Ser Ser Asn 

955 970 975 

AAA GAT GTA CTG AAT GAT TTA AGT ACC TCA GTT AAG GTT CAA CTT TAT 2976 

Lys Asp Val Leu Asn Asp Leu Ser Thr Ser Val Lys Val Gin Leu Tyr 
15 980 985 990 

GCT CAA CTA TTT AGT ACA GGT TTA AAT ACT ATA TAT GAC TCT ATC CAA 3024 

Ala Gin Leu Phe Ser Thr Gly Leu Asn Thr He Tyr Asp Ser He Gin 
995 1000 1005 

20 

TTA GTA AAT TTA ATA TCA AAT GCA GTA AAT GAT ACT ATA AAT GTA CTA 3 072 

Leu Val Asn Leu He Ser Asn Ala Val Asn Asp Thr He Asn Val Leu 

1010 1015 1020 

25 CCT ACA ATA ACA GAG GGG ATA CCT ATT GTA TCT ACT ATA TTA GAC GGA 312 0 

Pro Thr He Thr Glu Gly He Pro He Val Ser Thr He Leu Asp Gly 
1025 1030 . 1035 1040 

ATA AAC TTA GGT GCA GCA ATT AAG GAA TTA CTA GAC GAA CAT GAC CCA 3168 

30 He Asn Leu Gly Ala Ala He Lys Glu Leu Leu Asp Glu His Asp Pro 

1045 1050 1055 

TTA CTA AAA AAA GAA TTA GAA GCT AAG GTG GGT GTT TTA GCA ATA AAT 3216 

Leu Leu Lys Lys Glu Leu Glu Ala Lys Val Gly Val Leu Ala He Asn 
35 1060 1065 1070 

ATG TCA TTA TCT ATA GCT GCA ACT GTA GCT TCA ATT GTT GGA ATA GGT 3264 

Met Ser Leu Ser He Ala Ala Thr Val Ala Ser He Val Gly lie Gly 
1075 1080 1085 



40 



60 



GCT GAA GTT ACT ATT TTC TTA TTA CCT ATA GCT GGT ATA TCT GCA GGA 3 312 

Ala Glu Val Thr He Phe Leu Leu Pro He Ala Gly He Ser Ala Gly 
1090 1095 1100 



45 ATA CCT TCA TTA GTT AAT AAT GAA TTA ATA TTG CAT GAT AAG GCA ACT 3 360 

He Pro Ser Leu Val Asn Asn Glu Leu He Leu His Asp Lys Ala Thr 
1105 1110 1115 1120 

TCA GTG GTA AAC TAT TTT AAT CAT TTG TCT GAA TCT AAA AAA TAT GGC 3 4 08 

50 Ser Val Val Asn Tyr Phe Asn His Leu Ser Glu Ser Lys Lys Tyr Gly 

1125 1130 1135 

CCT CTT AAA ACA GAA GAT GAT AAA ATT TTA GTT CCT ATT GAT GAT TTA 3 4 56 

Pro Leu Lys Thr Glu Asp Asp Lys He Leu Val Pro He Asp Asp Leu 
55 1140 1145 1150 

GTA ATA TCA GAA ATA GAT TTT AAT AAT AAT TCG ATA AAA CTA GGA ACA 3 504 

Val He Ser Glu He Asp Phe Asn Asn Asn Ser He Lys Leu Gly Thr 
1155 1160 1165 



TGT AAT ATA TTA GCA ATG GAG GGG GGA TCA GGA CAC ACA GTG ACT GGT 3 552 

Cys Asn He Leu Ala Met Glu Gly Gly Ser Gly His Thr Val Thr Gly 
1170 1175 1180 



65 AAT ATA GAT CAC TTT TTC TCA TCT CCA TCT ATA AGT TCT CAT ATT CCT 

Asn He Asp His Phe Phe Ser Ser Pro Ser He Ser Ser His He Pro 
1185 1190 1195 1200 



3600 
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TCA TTA TCA ATT TAT TCT GCA ATA GGT ATA GAA ACA GAA AAT CTA GAT 364 
Ser Leu Ser lie Tyr Ser Ala lie Gly He Glu Thr Glu Asn Leu Asp 
1205 1210 1215 

TTT TCA AAA AAA ATA ATG ATG TTA CCT AAT GCT CCT TCA AGA GTG TTT 36 9( 

Phe Ser Lys Lys He Met Met Leu Pro Asn Ala Pro Ser Arg Val Phe 
1220 1225 1230 

TGG TGG GAA ACT GGA GCA GTT CCA GGT TTA AGA TCA TTG GAA AAT GAC 374- 
Trp Trp Glu Thr Gly Ala Val Pro Gly Leu Arg Ser Leu Glu Asn Asp 
1235 1240 1245 

GGA ACT AGA TTA CTT GAT TCA ATA AGA GAT TTA TAC CCA GGT AAA TTT 37 9; 

Gly Thr Arg Leu Leu Asp Ser He Arg Asp Leu Tvr Pro Gly Lys Phe 
1250 1255 1260 

TAC TGG AGA TTC TAT GCT TTT TTC GAT TAT GCA ATA ACT ACA TTA AAA 3 84 

Tyr Trp Arg Phe Tyr Ala Phe Phe Asp Tyr Ala He Thr Thr Leu Lys 
1265 1270 1275 1280 

CCA GTT TAT GAA GAC ACT AAT ATT AAA ATT AAA CTA GAT AAA GAT ACT 388 
Pro Val Tyr Glu Asp Thr Asn He Lys He Lys Leu Asp Lys Asp Thr 
1285 1290 1295 

AGA AAC TTC ATA ATG CCA ACT ATA ACT ACT AAC GAA ATT AGA AAC AAA 3 93 6 

Arg Asn Phe He Met Pro Thr He Thr Thr Asn Glu He Arg Asn Lys 
1300 1305 1310 

TTA TCT TAT TCA TTT GAT GGA GCA GGA GGA ACT TAC TCT TTA TTA TTA 3 984 

Leu Ser Tyr Ser Phe Asp Gly Ala Gly Gly Thr Tyr Ser Leu Leu Leu 
1315 1320 1325 

TCT TCA TAT CCA ATA TCA ACG AAT ATA AAT TTA TCT AAA GAT GAT TTA 4 032 

Ser Ser Tyr Pro He Ser Thr Asn He Asn Leu Ser Lys Asp Asp Leu 
1330 1335 1340 



TGG ATA TTT AAT ATT GAT AAT GAA GTA AGA GAA ATA TCT ATA GAA AAT 
Trp He Phe Asn He Asp Asn Glu Val Arg Glu He Ser He Glu Asn 
1^^5 1350 1355 1360 



4060 



GGT ACT ATT AAA AAA GGA AAG TTA ATA AAA GAT GTT TTA AGT AAA ATT 4128 
Gly Thr He Lys Lys Gly Lys Leu He Lys Asp Val Leu Ser Lys He 
1365 1370 1375 

GAT ATA AAT AAA AAT AAA CTT ATT ATA GGC AAT CAA ACA ATA GAT TTT 4176 
Asp He Asn Lys Asn Lys Leu lie He Gly Asn Gin Thr He Asp Phe 
1380 1385 1390 

TCA GGC GAT ATA GAT AAT AAA GAT AGA TAT ATA TTC TTG ACT TGT GAG 4224 
Ser Gly Asp He Asp Asn Lys Asp Arg Tyr He Phe Leu Thr Cys Glu 
1395 1400 1405 

TTA GAT GAT AAA ATT AGT TTA ATA ATA GAA ATA AAT CTT GTT GCA AAA 4272 
Leu Asp Asp Lys He Ser Leu He He Glu He Asn Leu Val Ala Lys 
1410 1415 1420 

TCT TAT AGT TTG TTA TTG TCT GGG GAT AAA AAT TAT TTG ATA TCC AAT 4320 
Ser Tyr Ser Leu Leu Leu Ser Gly Asp Lys Asn Tyr Leu He Ser Asn 
1425 1430 143S 1440 

TTA TCT AAT ACT ATT GAG AAA ATC AAT ACT TTA GGC CTA GAT AGT AAA 4 368 

Leu Ser Asn Thr He Glu Lys He Asn Thr Leu Gly Leu Asp Ser Lys 
1445 1450 1455 



wo 98/08540 PCT/US97/15394 



AAT ATA GCG TAG AAT TAG ACT GAT GAA TCT AAT AAT AAA TAT TTT GGA 4416 
Asn lie Ala Tyr Asn Tyr Thr Asp Glu Ser Asn Asn Lys Tyr Phe Gly 
1460 1465 1470 

5 GCT ATA TCT AAA ACA AGT CAA AAA AGC ATA ATA CAT TAT AAA AAA GAC 4464 

Ala He Ser Lys Thr Ser Gin Lys Ser lie He His Tyr Lys Lys Asp 
147S 1480 1485 

AGT AAA AAT ATA TTA GAA TTT TAT AAT GAC AGT ACA TTA GAA TTT AAC 4 512 

10 Ser Lys Asn He Leu Glu Phe Tyr Asn Asp Ser Thr Leu Glu Phe Asn 
1490 1495 1500 

AGT AAA GAT TTT ATT GCT GAA GAT ATA AAT GTA TTT ATG AAA GAT GAT 4 560 

Ser Lys Asp Phe He Ala Glu Asp He Asn Val Phe Met Lys Asp Asp 
15 1505 1510 1515 1520 

ATT AAT ACT ATA ACA GGA AAA TAC TAT GTT GAT AAT AAT ACT GAT AAA 46 08 

He Asn Thr He Thr Gly Lys Tyr Tyr Val Asp Asn Asn Thr Asp Lys 
1525 1530 1535 

20 

AGT ATA GAT TTC TCT ATT TCT TTA GTT AGT AAA AAT CAA GTA AAA GTA 46 56 

Ser He Asp Phe Ser He Ser Leu Val Ser Lys Asn Gin Val Lys Val 
1540 1545 1550 

25 AAT GGA TTA TAT TTA AAT GAA TCC GTA TAC TCA TCT TAC CTT GAT TTT 47 04 

Asn Gly Leu Tyr Leu Asn Glu Ser Val Tyr Ser Ser Tyr Leu Asp Phe 
1555 1560 15G5 

GTG AAA AAT TCA GAT GGA CAC CAT AAT ACT TCT AAT TTT ATG AAT TTA 4 752 

30 Val Lys Asn Ser Asp Gly His His Asn Thr Ser Asn Phe Met Asn Leu 

1570 1575 15B0 

TTT TTG GAC AAT ATA AGT TTC TGG AAA TTG TTT GGG TTT GAA AAT ATA 4800 
Phe I,eu Asp Asn He Ser Phe Trp Lys Leu Phe Gly Phe Glu Asn He 
35 1585 1590 1595 1600 

AAT TTT GTA ATC GAT AAA TAC TTT ACC CTT GTT GGT AAA ACT AAT CTT 4 84 8 

Asn Phe Val He Asp Lys Tyr Phe Thr Leu Val Gly Lys Thr Asn Leu 
1605 1610 1615 

40 

GGA TAT GTA GAA TTT ATT TGT GAC AAT AAT AAA AAT ATA GAT ATA TAT 4 8 96 

Gly Tyr Val Glu Phe He Cys Asp Asn Asn Lys Asn He Asp He Tyr 

1620 ' 1625 1630 

45 TTT GGT GAA TGG AAA ACA TCG TCA TCT AAA AGC ACT ATA TTT AGC GGA 4 944 

Phe Gly Glu Trp Lys Thr Ser Ser Ser Lys Ser Thr He Phe Ser Gly 
1635 1640 1645 

AAT GGT AGA AAT GTT GTA GTA GAG CCT ATA TAT AAT CCT GAT ACG GGT 4 992 

50 Asn Gly Arg Asn Val Val Val Glu Pro He Tyr Asn Pro Asp Thr Gly 

1650 1655 1660 

GAA GAT ATA TCT ACT TCA CTA GAT TTT TCC TAT GAA CCT CTC TAT GGA 5040 
Glu Asp He Ser Thr Ser Leu Asp Phe Ser Tyr Glu Pro Leu Tyr Gly 
55 1665 1670 1675 1680 

ATA GAT AGA TAT ATA AAT AAA GTA TTG ATA GCA CCT GAT TTA TAT ACA 5088 
He Asp Arg Tyr He Asn Lys Val Leu He Ala Pro Asp Leu Tyr Thr 
1685 1690 1695 



60 



AGT TTA ATA AAT ATT AAT ACC AAT TAT TAT TCA AAT GAG TAC TAC CCT 5136 
Ser Leu He Asn He Asn Thr Asn Tyr Tyr Ser Asn Giu Tyr Tyr Pro 
1700 L705 1710 



65 GAG ATT ATA GTT CTT AAC CCA AAT ACA TTC CAC AAA AAA GTA AAT ATA 5184 

Glu Ho He Val Leu Asn Pro Asn Thr Phe His Lys Lys Val Asn He 
1715 1720 1725 



- 226 - 



wo 98/08540 



PCT/US97/15394 



AAT TTA GAT AGT TCT TCT TTT GAG TAT AAA TGG TCT ACA GAA GGA AGT 5232 
Asn Leu Asp Ser Ser Ser Phe Glu Tyr Lys Trp Ser Thr Glu Gly Ser 
1730 1735 1740 

5 GAC TTT ATT TTA GTT AGA TAG TTA GAA GAA AGT AAT AAA AAA ATA TTA 5280 

Asp Phe He Leu Val Arg Tyr Leu Glu Glu Ser Asn Lys Lys He Leu 
1745 1750 1755 1760 

CAA AAA ATA AGA ATC AAA GGT ATC TTA TCT AAT ACT CAA TCA TTT AAT 5328 
10 Gin Lys He Arg He Lys Gly He Leu Ser Asn Thr Gin Ser Phe Asn 

1765 X770 1775 
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65 



AAA ATG AGT ATA GAT TTT AAA GAT ATT AAA AAA CTA TCA TTA GGA TAT 53 76 

Lys Met Ser He Asp Phe Lys Asp He Lys Lys Leu Ser Leu Gly Tyr 
1780 1785 1790 

ATA ATG AGT AAT TTT AAA TCA TTT AAT TCT GAA AAT GAA TTA GAT AGA 54 24 

He Met Ser Asn Phe Lys Ser Phe Asn Ser Glu Asn Glu Leu Asp Arq 
1795 1800 1805 

GAT CAT TTA GGA TTT AAA ATA ATA GAT AAT AAA ACT TAT TAC TAT GAT 54 72 

Asp His Leu Gly Phe Lys He He Asp Asn Lys Thr Tyr Tyr T^t Asp 
1810 1815 1820 

GAA GAT AGT AAA TTA GTT AAA GGA TTA ATC AAT ATA AAT AAT TCA TTA 5 52 0 

Glu Asp Ser Lys Leu Val Lys Gly Leu He Asn He Asn Asn Ser Leu 
1825 1830 1835 1840 

TTC TAT TTT GAT CCT ATA GAA TTT AAC TTA GTA ACT GGA TGG CAA ACT 5568 
Phe Tyr Phe Asp Pro He Glu Phe Asn Leu Val Thr Gly Trp Gin Thr 
1845 1850 1855 



ATC AAT GGT AAA AAA TAT TAT TTT GAT ATA AAT ACT GGA GCA GCT TTA 5616 
He Asn Gly Lys Lys Tyr Tyr Phe Asp He Asn Thr Gly Ala Ala Leu 
I860 1865 1870 



ACT AGT TAT AAA ATT ATT AAT GGT AAA CAC TTT TAT TTT AAT A/iT GAT 56 64 

Thr Ser Tyr Lys He He Asn Gly Lvs His Phe Tyr Phe Asn Asn Asp 
1875 1880 1885 

GGT GTG ATG CAG TTG GGA GTA TTT AAA GGA CCT GAT GGA TTT GAA TAT 5712 
Gly Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Gly Phe Glu Tyr 
1890 1895 1900 



4> TTT GCA CCT GCC AAT ACT CAA AAT AAT AAC ATA GAA GGT CAG GCT ATA 

Phe Ala Pro Ala Asn Thr Gin Asn Asn Asn He Glu Gly Gin Ala He 
1905 1910 1915 1920 



TCA AAA GGT TGG CAG ACT GTT AAT GGT AGT AGA TAC TAC TTT GAT ACT 
Ser Lys Gly Trp Gin Thr Val Asn Gly Ser Arg Tyr Tyr Phe Asp Thr 
1985 1990 1995 2000 



5760 



GTT TAT CAA AGT AAA TTC TTA ACT TTG AAT GGC AAA AAA TAT TAT TTT 58 0 8 

Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe 
1925 1930 1935 

GAT AAT AAC TCA AAA GCA GTC ACT GGA TGG AGA ATT ATT AAC AAT GAG 58 56 

Asp Asn Asn Ser Lys Ala Val Thr Gly Trp Arg He He Asn Asn Glu 
1940 1945 1950 

AAA TAT TAC TTT AAT CCT AAT AAT GCT ATT GCT GCA GTC GGA TTG CAA 5904 
Lys Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala Val Gly Leu Gin 
1955 I960 1965 

GTA ATT GAC AAT AAT AAG TAT TAT TTC AAT CCT GAC ACT GCT ATC ATC 5952 
Val He Asp Asn Asn Lys Tyr Tyr Phe Asn Pro Asp Thr Ala He He 
1970 1975 1980 



6000 
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GAT ACC GCT ATT GCC TTT AAT GGT TAT AAA ACT ATT GAT GGT AAA CAC 6048 
Asp Thr Ala He Ala Phe Asn Gly Tyr Lys Thr He Asp Gly Lys His 
2005 2010 2015 

5 TTT TAT TTT GAT AGT GAT TGT GTA GTG AAA ATA GGT GTG TTT AGT ACC 6096 

Phe Tyr Phe Asp Ser Asp Cys Val Val Lys He Gly Val Phe Ser Thr 
2020 2025 2030 

TCT AAT GGA TTT GAA TAT TTT GCA CCT GCT AAT ACT TAT AAT AAT AAC 6144 
10 Ser Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn 

2035 2040 2045 

ATA GAA GGT CAG GCT ATA GTT TAT CAA AGT AAA TTC TTA ACT TTG AAT 6192 
He Glu Gly Gin Ala He Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn 
15 2050 2055 2060 

GGT AAA AAA TAT TAC TTT GAT AAT AAC TCA AAA GCA GTT ACC GGA TTG 624 0 

Giy Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu 
2065 2070 2075 2080 

20 

CAA ACT ATT GAT AGT AAA AAA TAT TAC TTT AAT ACT AAC ACT GCT GAA 628 B 

Gin Thr He Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu 
2085 2090 2095 

25 GCA GCT ACT GGA TGG CAA ACT ATT GAT GGT AAA AAA TAT TAC TTT AAT 63 36 

Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn 
2100 2105 ' 2110 

ACT AAC ACT GCT GAA GCA GCT ACT GGA TGG CAA ACT ATT GAT GGT AAA 6384 
M) Thr Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys 

2115 2120 2125 

AAA TAT TAC TTT AAT ACT AAC ACT GCT ATA GCT TCA ACT GGT TAT ACA 64 32 

Lys Tyr Tyr Phe Asn Thr Asn Thr Ala He Ala Ser Thr Gly Tyr Thr 
35 21 30 2135 2140 

ATT ATT AAT GGT AAA CAT TTT TAT TTT AAT ACT GAT GGT ATT ATG CAG 64 8 0 

He He Asn Gly Lys His Phe Tyr Phe Asn Thr Asp Giy He Met Gin 

2145 2150 2155 2160 

40 

ATA GGA GTG TTT AAA GGA CCT AAT GGA TTT GAA TAT TTT GCA CCT GCT 6 528 

He Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro Ala 
2165 2170 2175 

45 ,^AT ACG GAT GCT AAC AAC ATA GAA GGT. CAA GCT ATA CTT TAC CAA AAT 6 576 

Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He Leu Tyr Gin Asn 
2180 2185 2190 

GAA Tl'C TTA ACT TTG AAT GGT AAA AAA TAT TAC TTT GGT AGT GAC TCA 6624 
50 Glu Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Gly Ser Asp Ser 

2195 2200 2205 

AAA GCA GTT ACT GGA TGG AGA ATT ATT AAC AAT AAG AAA TAT TAC TTT 6672 
Lys Ala Val Thr Gly Trp Arg He He Asn Asn Lys Lys Tyr Tyr Phe 
55 2210 2215 2220 

AAT CCT AAT AAT GCT ATT GCT GCA ATT CAT CTA TGC ACT ATA AAT AAT 6 720 

Asn Pro Asn Asn Ala He Ala Ala He His Leu Cys Thr He Asn Asn 
2225 2230 2235 2240 

60 

GAC AAG TAT TAC TTT AGT TAT GAT GGA ATT CTT CAA AAT GGA TAT ATT 676 8 

Asp Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly Tyr He 
2245 2250 2255 

65 ACT ATT GAA AGA AAT AAT TTC TAT TTT GAT GCT AAT AAT GAA TCT AAA 6816 

Thr He Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Glu Ser Lys 
2260 2265 2270 



- 228 - 



10 



WO9aA>8540 PCTAJS97/15394 

ATG GTA ACA GGA GTA TTT AAA GGA CCT AAT GGA TTT GAG TAT TTT GCA 6864 

Met Val Thr Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala 
2275 2280 2285 

CCT GCT AAT ACT CAC AAT AAT AAC ATA GAA GGT CAG GCT ATA GTT TAC 6 912 

Pro Ala Asn Thr His Asn Asn Asn lie Glu Gly Gin Ala lie Val Tyr 
2290 2295 2300 

CAG AAC AAA TTC TTA ACT TTG AAT GGC AAA AAA TAT TAT TTT GAT AAT 6960 

Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Asp Asn 

2305 2310 2315 2320 



GAC TCA AAA GCA GTT ACT GGA TGG CAA ACC ATT GAT GGT AAA AAA TAT 7 008 

Asp Ser Lys Ala Val Thr Gly Trp Gin Thr lie Asp Gly Lys Lys Tyr 
2325 2330 2335 



20 



>0 



40 



TAC TTT AAT CTT AAC ACT GCT GAA GCA GCT ACT GGA TGG CAA ACT ATT 7056 
Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr lie 
2340 2345 2350 

GAT GGT AAA AAA TAT TAC TTT AAT CTT AAC ACT GCT GAA GCA GCT ACT 7104 
Asp Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr 
2355 2360 2365 

GGA TGG CAA ACT ATT GAT GGT AAA AAA TAT TAC TTT AAT ACT AAC ACT 7152 
Gly Trp Gin Thr lie Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
2370 2375 2380 

TTC ATA GCC TCA ACT GGT TAT ACA AGT ATT AAT GGT AAA CAT TTT TAT 7200 
Phe lie Ala Ser Thr Gly Tyr Thr Ser lie Asn Gly Lys His Phe Tyr 
2385 2390 2395 2400 



TTT AAT ACT GAT GGT ATT ATG CAG ATA GGA GTG TTT AAA GGA CCT AAT 724 8 

Phe Asn Thr Asp Gly lie Met Gin lie Gly Val Phe Lys Gly Pro Asn 
2405 2410 2415 



GGA TTT GAA TAC TTT GCA CCT GCT AAT ACG GAT GCT AAC AAC ATA GAA 72 96 

Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn lie Glu 
2420 2425 2430 

GGT CPxA GCT ATA CTT TAC CAA AAT AAA TTC TTA ACT TTG AAT GGT AAA 7344 
Gly Gin Ala lie Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys 
2435 2440 2445 

4^ AAA TAT TAC TTT GGT AGT GAC TCA AAA GCA GTT ACC GGA CTG CGA ACT 73 92 

Lys Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arg Thr 
2450 2455. 2460 

ATT GAT GGT AAA AAA TAT TAC TTT AAT ACT AAC ACT GCT GTT GCA GTT 7440 
:>0 He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val 

2465 2470 2475 2480 

ACT GGA TGG CAA ACT ATT AAT GGT AAA AAA TAC TAC TTT AAT ACT AAC 74 88 

Thr Gly Trp Gin Thr lie Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn 
2485 2490 2495 

ACT TCT ATA GCT TCA ACT GGT TAT ACA ATT ATT AGT GGT AAA CAT TTT 7536 
Thr Ser He Ala Ser Thr Gly Tyr Thr lie He Ser Gly Lys His Phe 
2500 2505 2510 

60 

TAT TTT AAT ACT GAT GGT ATT ATG CAG ATA GGA GTG TTT AAA GGA CCT 7584 
Tyr Phe Asn Thr Asp Gly lie Met Gin He Gly Val Phe Lys Gly Pro 
2515 2520 2525 

65 GAT GGA TTT GAA TAC TTT GCA CCT GCT AAT ACA GAT GCT AAC AAT ATA 7632 

Asp Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He 
2530 2535 2540 
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GAA GGT CAA GCT ATA CGT TAT CAA AAT AGA TTC CTA TAT TTA CAT GAC 7680 
Glu Gly Gin Ala lie Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp 
2545 2550 2555 2560 

5 AAT ATA TAT TAT TTT GGT AAT AAT TCA AAA GCG GCT ACT GGT TGG GTA 7728 

Asn He Tyr Tyr Phe Gly Asn Asn Ser Lys Ala Ala Thr Gly Trp Val 
2565 2570 2575 

ACT ATT GAT GGT AAT AGA TAT TAC TTC GAG CCT AAT ACA GCT ATG GGT 77 76 

10 Thr lie Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Gly 

2580 2585 2590 

GCG AAT GGT TAT AAA ACT ATT GAT AAT AAA AAT TTT TAC TTT AGA AAT 7824 
^ ^ Ala Asn Gly Tyr Lys Thr He Asp Asn Lys Asn Phe Tyr Phe Arg Asn 

1^ 2595 2600 2605 

GGT TTA CCT CAG ATA GGA GTG TTT AAA GGG TCT AAT GGA TTT GAA TAC 7872 

Gly Leu Pro Gin He Gly Val Phe Lys Gly Ser Asn Gly Phe Glu Tyr 

2610 2615 2620 

20 

TTT GCA CCT GCT AAT ACG GAT GCT AAC AAT ATA GAA GGT CAA GCT ATA 7 920 

Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He 

2625 2630 2635 2640 

25 CGT TAT CAA AAT AGA TTC CTA CAT TTA CTT GGA AAA ATA TAT TAC TTT 7 968 

Arg Tyr Gin Asn Arg Phe Leu His Leu Leu Gly Lys He Tyr Tyr Phe 
2645 2650 ' 2655 

GGT AAT AAT TCA AAA GCA GTT ACT GGA TGG CAA ACT ATT AAT GGT AAA 8016 
30 Gly Asn Asn Ser Lys Ala Val Thr Gly Trp Gin Thr He Asn Gly Lys 

2660 2665 2670 

GTA TAT TAC TTT ATG CCT GAT ACT GCT ATG GCT GCA GCT GGT GGA CTT 8064 
Val Tyr Tyr Phe Met Pro Asp Thr Ala Met Ala Ala Ala Gly Gly Leu 
^5 2675 2680 2685 

TTC GAG ATT GAT GGT GTT ATA TAT TTC TTT GGT GTT GAT GGA GTA AAA 8112 
Phe Glu He Asp Gly Val He Tyr Phe Phe Gly Val Asp Gly Val Lys 
2690 2695 2700 



40 



GCC CCT GGG ATA TAT GGC TAA 813 3 

Ala Pro Gly He Tyr Gly 
2705 2710 



4S (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID ^0:6: 

55 

Met Ser Leu lie Ser Lys Glu Glu Leu He Lys Leu Ala Tyr Ser He 
15 10 15 

Arg Pro Arg Glu Asn Glu Tyr Lys Thr He Leu Thr Asn Leu Asp Glu 
60 20 25 

Tyr Asn Lys Leu Thr Thr Asn Asn Asn Glu Asn Lys Tyr Leu Gin Leu 
35 40 -55 

65 Lys Lys Leu Asn Glu Ser He Asp Val Phe Met Asn Lys Tyr Lys Thr 

50 55 60 



70 



Ser Ser Arg Asn Arg Ala Leu Ser Asn Leu Lys Lys Asp He Leu Lys 
65 70 75 80 
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Glu Val lie Leu lie Lys Asn Ser Asn Thr Ser Pro Val Glu hys Asn 
85 90 95 

Leu His Phe Val Trp lie Gly Gly Glu Val Ser Asp He Ala Leu Glu 
100 * 105 110 

Tyr He Lys Gin Trp Ala Asp lie Asn Ala Glu Tyr Asn He Lys Leu 
115 120 125 

Trp Tyr Asp Ser Glu Ala Phe Leu Val Asn Thr Leu Lys Lys Ala He 
130 135 140 

Val Glu Ser Ser Thr Thr Glu Ala Leu Gin Leu Leu Glu Glu Glu He 
150 155 160 

Gin Asn Pro Gin Phe Asp Asn Met Lys Phe Tyr Lys Lys Arg Met Glu 
165 170 ' ' 175 

Phe He Tyr Asp Arg Gin Lys Arg Phe He Asn Tyr Tyr Lys Ser Gin 
180 185 190 

He Asn Lys Pro Thr Val Pro Thr He Asp Asp He He Lys Ser His 
195 200 205 

Leu Val Ser Glu Tyr Asn Arg Asp Glu Thr Val Leu Glu Ser Tyr Arg 
210 215 220 

Thr Asn Ser Leu Arg Lys He Asn Ser Asn His Gly He Asd He Arg 
230 235 ' 240 

Ala Asn Ser Leu Phe Thr Glu Gin Glu Leu Leu Asn He Tyr Ser Gin 
245 250 255 

Glu Leu Leu Asn Arg Gly Asn Leu Ala Ala Ala Ser Asp He Val Arg 
260 265 270 

Leu Leu Ala Leu Lys Asn Phe Gly Gly Val Tyr Leu Asp Val Asd Met 
275 280 2B5 

Leu Pro Gly He His Ser Asp Leu Phe Lys Thr He Ser Arg Pro Ser 
290 235 300 

Ser lie Gly Leu Asp Arg Trp Glu Met He Lys Leu Glu Ala He Met 
305 310 315 320 

Lys Tyr Lys Lys Tyr He Asn Asn Tyr Thr Ser Glu Asn Phe Asp l*vs 
325 330 335 

Leu Asp Gin Gin Leu Lys Asp Asn Phe Lys Leu He He Glu Sor Lys 
340 345 350 

Ser Glu Lys Ser Glu He Phe Ser Lys Leu Glu Asn Leu Asn Val Ser 
355 360 365 

Asp Leu Glu He Lys He Ala Phe Ala Leu Gly Ser Val He Asn Gin 
370 375 380 

Ala Leu He Ser Lys Gin Gly Ser Tyr Leu Thr Asn Leu Val He Glu 
385 390 395 400 

Gin Val L'ys Asn Arg Tyr Gin Phe Leu Asn Gin His Leu Asn Pro Ala 
405 410 -115 

He Glu Ser Asp Asn Asn Phe Thr Asp Thr Thr Lys He Phe His Asp 
420 425 430 

Ser Leu Phe Asn Ser Ala Thr Ala Glu Asn Ser Met Phe Leu Thr Lys 
435 440 445 
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lie Ala Pro Tyr Leu Gin Val Gly Phe Met Pro Glu Ala Arg Ser Thr 
450 455 460 

He Ser Leu Ser Gly Pro Gly Ala Tyr Ala Ser Ala Tyr Tyr Asp Phe 
465 470 475 480 

He Acn Leu Gin Glu Asn Thr He Glu Lys Thr Leu Lvs Ala Ser Asp 
485 490 ' 495 

Leu He Glu Phe Lys Phe Pro Glu Asn Asn Leu Ser Gin Leu Thr Glu 
500 505 510 

Gin Glu He Asn Ser Leu Trp Ser Phe Asp Gin Ala Ser Ala Lys Tyr 
515 520 525 

Gin Phe Glu Lys Tyr Val Arq Asp Tvr Thr Gly Gly Ser Leu Ser Glu 
^30 535 540 

Asp Asn Gly Val Asp Phe Asn Lys Asn Thr Ala Leu Asp Lys Asn Tyr 
550 555 560 

Leu Leu Asn Asn Lys He Pro Ser Asn Asn Val Glu Glu Ala Gly Ser 
565 570 575 

Lys Asn Tyr Val His Tyr He He Gin Leu Gin Glv Asp Asp He Ser 
580 585 ' 590 

Tyr Glu Ala Thr Cys Asn Leu Phe Ser Lys Asn Pro Lys Asn Ser He 
595 600 605 

lie He Gin Arg Asn Met Asn Glu Ser Ala Lys Ser Tyr Phe Leu Ser 
610 615 G20 

Asp Asp Gly Glu Ser He Leu Glu Leu Asn Lys Tyr Arg He Pro Glu 
^25 630 635 640 

Arg Leu Lys Asn Lys Glu Lys Val Lys Val Thr Phe He Gly His Gly 
645 650 655 

Lys Asp Glu Phe Asn Thr Ser Glu Phe Ala Arg Leu Ser Val Asp Ser 
660 665 670 

Leu Ser Asn Glu He Ser Ser Phe Leu Asp Thr He Lys Leu Asp He 
675 660 685 

45 

SrtY Pro Lys Asn Val Glu Val Asn Leu Leu Glv Cys Asn M-L^t Phe Ser 
690 695 * 700 

Tyr Asp Phe Asn Val (Jlu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Ser 
705 710 715 720 

He Met Asp Lys He Thr Ser Thr Leu Pro Asp Val Asn Lys Asn Ser 
725 730 735 

55 He Thr He Gly Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly 

740 745 750 

Arg Lys Glu Leu Leu Ala His Ser Gly Lys Trp He Asn Lys Glu Glu 
755 760 765 

Ala He Met Ser Asp Leu Ser Ser Lys Glu Tyr He Phe Phe Asp Ser 
770 775 780 

He Asp Asn Lys Leu Lys Ala Lys Ser Lys Asn He Pro Gly Leu Ala 
785 790 795 800 

Ser He Ser Glu Asp He Lys Thr Leu Leu Leu Asp Ala Ser Val Ser 
805 810 815 
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Pro Asp Thr Lys Phe lie Leu Asn Asn Leu Lys Leu Asn lie Glu Ser 
820 825 830 

Ser lie Gly Asp Tyr lie Tyr Tyr Glu Lys Leu Glu Pro Val Lys Asn 
835 840 845 

He He His Asn Ser He Asp Asp Leu He Asp Glu Phe Asn Leu Leu 
850 855 860 

Glu Asn Val Ser Asp Glu Leu Tyr Glu Leu Lys Lys Leu Asn Asn Leu 
865 870 ' 875 880 

Asp Glu Lys Tyr Leu He Ser Phe Glu Asp He Ser Lys Asn Asn Ser 
885 890 895 

Thr Tyr Ser Val Arg Phe He Asn Lys Ser Asn Gly Glu Ser Val Tyr 
900 905 910 

Val Glu Thr Glu Lys Glu He Phe Ser Lys Tyr Ser Glu His He Thr 
915 920 925 

Lys Glu He Ser Thr He Lys Asn Ser He He Thr Asp Val Asn Gly 
930 935 940 

Asn Leu Leu Asp Asn He Gin Leu Asp His Thr Ser Gin Val Asn Thr 
945 950 955 960 

Leu Asn Ala Ala Phe Phe He Gin Ser Leu He Asp Tyr Ser Ser Asn 
965 970 975 

Lys Asp Val Leu Asn Asp Leu Ser Thr Ser Val Lys Val Gin Leu Tyr 
980 985 990 

Ala Gin Leu Phe Ser Thr Gly Leu Asn Thr He Tyr Asp Ser He Gin 
995 1000 1005 

Leu Val Asn Leu He Ser Asn Ala Val Asn Asp Thr He Asn Val Leu 
1010 1015 1020 

Pro Thr He Thr Glu Gly He Pro He Val Ser Thr He Leu Asp Gly 
1025 ^ 1030 1035 1040 

He Asn Leu Gly Ala Ala He Lys Glu Leu Leu Asp Glu His Asp Pro 
1045 1050 1055 

Leu Leu Lys Lys Glu Leu Glu Ala Lys Val Gly Val Leu Ala He Asn 
1060 1065 1070 

Met Ser Leu Ser He Ala Ala Thr Val Ala Ser He Val Gly He Gly 
1075 1080 1085 

Ala Glu Val Thr He Phe Leu Leu Pro He Ala Gly He Ser Ala Gly 
1090 1095 1100 

He Pro Ser Leu Val Asn Asn Glu Leu He Leu His Asp Lys Ala Thr 
1105 1110 1115 1120 

Ser Val Val Asn Tyr Phe Asn His Leu Ser Glu Ser Lvs Lys Tyr Gly 
1125 1130 * 1135 

Pro Leu Lys Thr Glu Asp Asp Lys He Leu Val Pro He Asp Asp Leu 
1140 1145 1150 

Val He Ser Glu He Asp Phe Asn Asn Asn Ser He Lvs Leu Gly Thr 
1155 1160 1165 

Cys Asn He Leu Ala Met Glu Gly Gly Ser Gly His Thr Val Thr Gly 
1170 1175 1180 
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Asn Xle Asp His Phe Phe Ser Ser Pro Ser He Ser Ser His He Pro 
1^^^ 1150 1195 1200 

ser Leu Ser He Tyr Ser Ala He Gly He Glu Thr Glu Asn Leu Asp 
1205 1210 1215 

Phe Ser Lys Lys He Met Met Leu Pro Asn Ala Pro Ser Arg Val Phe 
1220 1225 1230 

Trp Trp Glu Thr Gly Ala Val Pro Gly Leu Arg Ser Leu Glu Asn Asn 
1235 1240 1245 

Gly Thr Arg Leu Leu Asp Ser He Arg Asp Leu Tyr Pro Gly Lys Phe 
1250 1255 1260 

Tyr Trp Arg Phe Tyr Ala Phe Phe Asp Tyr Ala He Thr Thr Leu Lys 
1-65 1270 1275 1280 

Pro Val Tyr Glu Asp Thr Asn He Lys lie Lys Leu Asp Lys Asp Thr 
1285 1290 1295 

Arg Asn Phe He Met Pro Thr He Thr Thr Asn Glu He Arg Asn Lys 
1300 1305 i3io 

Leu Ser Tyr Ser Phe Asp Gly Ala Gly Gly Thr Tyr Ser Leu Leu Leu 
1315 1320 1325 

Ser Ser Tyr Pro He Ser Thr Asn He Asn Leu Ser Lvs Asp Asd Leu 
1-10 1335 1340 

Trp He Phe Asn lie Asp Asn Glu Val Arg Glu He Ser He Glu Asn 
1345 1350 1355 1360 

Gly Thr He Lys Lys Gly Lys Leu He Lys Asp Val Leu Ser Lys He 
1365 1370 1375 

Asp He Asn Lys Asn Lys Leu He He Gly Asn Gin Thr He Asp Phe 
1360 1385 1390 

Ser Gly Asp He Asp Asn Lys Asp Arq Tyr He Phe Leu Thr Cys Glu 
1395 1400 1405 

Leu Asp Asp Lys lie Ser Leu He He Glu He Asn Leu Val Ala Lys 
1410 1.115 1420 

Ser Tyr Ser Leu Leu Leu Ser Gly Asp Lys Asn Tyr Leu He Ser Asn 
1425 1430 1435 1440 

Leu Ser Asn Thr He Glu Lys He Asn Thr Leu Gly Leu Asp Ser Lys 
1445 1450 1455 

Asn He Ala Tyr Asn Tyr Thr Asp Glu Ser Asn Asn Lys Tyr Phe Gly 
1460 1465 1470 

Ala He Ser Lys Thr Ser Gin Lys Ser He He His Tyr Lys Lys Asp 
1475 1480 1485 

Ser Lys Asn He Leu Glu Phe Tyr Asn Asp Ser Thr Leu Glu Phe Asn 
1490 1495 1500 

Ser Lys A^p Phe He Ala Glu Asp He Asn Val Phe Met Lys Asp Asp 
150^ 1510 1515 1520 

He Asn Thr He Thr Gly Lys Tyr Tyr Val Asp Asn Asn Thr Asp Lys 
1525 1530 1535 

Ser He Asp Phe Ser He Ser Leu Val Ser Lys Asn Gin Val Lys Val 
1540 1545 1550 
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Asn Gly Leu Tyr Leu Asn Glu Ser Val Tyr Ser Ser Tyr Leu Asp Phe 
1555 1S60 1565 

Val Lys Asn Ser Asp Gly His His Asn Thr Ser Asn Phe Met Asn Leu 
1570 15-75 15B0 

Phe Leu Asp Asn lie Ser Phe Trp Lys Leu Phe Gly Phe Glu Asn He 
1585 1590 1595 1600 

Asn Phe Val He Asp Lys Tyr Phe Thr Leu Val Gly Lys Thr Asn Leu 

1605 1610 1615 

Gly Tyr Val Glu Phe He Cys Asp Asn Asn Lys Asn He Asp He Tyr 
1620 1625 1630 

Phe Gly Glu Trp Lys Thr Ser Ser Ser Lys Ser Thr He Phe Ser Gly 
1635 1640 1645 

Asn Gly Arg Asn Val Val Val Glu Pro He Tyr Asn Pro Asp Thr Gly 
1650 1655 1660 

Glu Asp He Ser Thr Ser Leu Asp Phe Ser Tyr Glu Pro Leu Tvr Gly 
1665 1670 1675 ' 1680 

He Asp Arg Tyr He Asn Lys Val Leu He Ala Pro Asp Leu Tyr Thr 

1685 1690 1695 

Ser Leu He Asn He Asn Thr Asn Tyr Tyr Ser Asn Glu Tyr Tyr Pro 
1700 1705 1710 

Glu He He Val Leu Asn Pro Asn Thr Phe His Lys Lys Val Asn He 
1715 1720 1725 

Asn Leu Asp Ser Ser Ser Phe Glu Tyr Lys Trp Ser Thr Glu Gly Ser 
1730 1735 1740 

Asp Phe He Leu Val Arg Tyr Leu Glu Glu Ser Asn Lys Lys He Leu 
1745 1750 1755 1760 

Gin Lys He Arg He Lys Gly He Leu Ser Asn Thr Gin Ser Phe Asn 

1765 1770 1775 

Lys Met Ser He Asp Phe Lys Asp He Lys Lys Leu Ser Leu Gly Tyr 
1780 1765 1790 

lie Met Ser Asn Phe Lys Ser Phe Asn Ser Glu Asn Glu Leu Asp Arg 
1795 * 1800 1B05 

Asp His Leu Gly Phe Lys He He Asp Asn Lys Thr Tyr Tyr Tyr Asp 
1810 1815 1820 

Glu Asp Ser Lys Leu Val Lys Gly Leu He Asn He Asn Asn Ser Leu 
1825 1830 1835 1840 

Phe Tyr Phe Asp Pro He Glu Phe Asn Leu Val Thr Gly Trp Gin Thr 

1845 1850 1855 

He Asn Gly Lys Lys Tyr Tyr Phe Asp He Asn Thr Gly Ala Ala Leu 
1860 1865 1870 

Thr Ser Tyr Lys He He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp 
1875 1880 1885 

Gly Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Gly Phe Glu Tyr 
1890 1895 1900 

Phe Ala Pro Ala Asn Thr Gin Asn Asn Asn He Glu Gly Gin Ala He 
1905 1910 1915 1920 
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Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe 
1925 1930 1935 

Asp Asn Asn Ser Lys Ala Val Thr Gly Trp Arg He He Asn Asn Glu 
1940 1945 1950 

Lys Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala Val Gly Leu Gin 
1955 I960 1965 

Val He Asp Asn Asn Lys Tyr Tyr Phe Asn Pro Asp Thr Ala He He 
1970 ^1975 1980 

Ser Lys Gly Trp Gin Thr Val Asn Gly Ser Arg Tyr Tyr Phe Asp Thr 
1985 1990 1995 ' 2000 

Asp Thr Ala He Ala Phe Asn Gly Tyr Lys Thr He Asp Gly Lys His 
2005 2010 2015 

Phe Tyr Phe Asp Ser Asp Cys Val Val Lys He Gly Val Phe Ser Thr 
2020 2025 2030 

Ser Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn 
2035 2040 2045 

lie Glu Gly Gin Ala He Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn 
-0^0 2055 2060 

Gly Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu 
2065 2070 2075 2080 

Gin Thr He Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu 
2085 2090 2095 

Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lvs Tyr Tyr Phe Asn 
2100 2105 ' 2110 

Thr Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr lie Asp Glv Lvs 
2115 2120 2125 

Lys Tyr Tyr Phe Asn Thr Asn Thr Ala He Ala Ser Thr Giy Tyr Thr 
2130 2135 2140 

He He Asn Gly Lys His Phe Tyr Phe Asn Thr Asp Gly He Mer Gin 
21^5 2150 2155 2160, 

He Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro Ala 
2165 2170 2175 

Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He Leu Tvr Gin Asn 
2180 2185 2190 

Glu Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Gly Ser Asp Ser 
2195 2200 2205 

Lys Ala Val Thr Gly Trp Arg He He Asn Asn Lys Lys Tyr Tyr Phe 
2210 2215 2220 

Asn Pro Asn Asn Ala He Ala Ala He His Leu Cys Thr He Asn Asn 
2225 2230 2235 2240 

Asp Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly Tyr He 
2245 2250 2255 

Thr He Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Giu Ser Lys 
2260 2265 2270 

Met Val Thr Gly Val Phe Lys Gly Pro Asn Glv Phe Glu Tyr Phe Ala 
2275 2280 ' 2285 



> 236 - 



wo 98/08540 



PCT/US97/15394 



Pro Ala Asn Thr His Asn Asn Asn He Glu Gly Gin Ala lie Val Tyr 
2290 2295 2300 

Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tvr Tyr Phe Asp Asn 
5 2305 2310 231S ' 2320 

Asp Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr 
2325 2330 2335 

10 Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He 

2340 2345 2350 



15 



30 



45 



60 



Asp Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr 
2355 2360 2365 

Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
2370 2375 2380 



Phe He Ala Ser Thr Gly Tyr Thr Ser He Asn Gly Lys His Phe Tyr 
20 2385 2390 2395 ' 2400 

Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro Asn 
2405 2410 ' 2415 

25 Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu 

2420 2425 2430 



Gly Gin Ala He Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys 
2435 2440 2445 

Lys Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arg Thr 
2450 2455 2460 



He Asp Glv Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val 
3^ 2465 ' 2470 2475 24B0 

Thr Gly Trp Gin Thr He Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn 
2485 2490 2495 

40 Thr Ser lie Ala Ser Thr Gly Tyr Thr He He Ser Gly Lys His Phe 

2500 2505 2510 



Tyr Phe Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro 
2515 2520 2525 

Asp Gly Phe Glu Tvr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He 
2530 ' 2535 2540 



Glu Gly Gin Ala He Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp 
50 2545 2550 2555 2560 

Asn He Tyr Tyr Phe Gly Asn Asn Ser Lys Ala Ala Thr Gly Trp Val 
2565 2570 2575 

55 Thr He Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Gly 

2580 2585 2590 



Ala Asn Gly Tyr Lys Thr He Asp Asn Lys Asn Phe Tyr Phe Arg Asn 
2595 2600 2605 

Gly Leu Pro Gin He Gly Val Phe Lys Gly Ser Asn Gly Phe Glu Tyr 

2610 2615 2620 



Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala He 
65 2625 2630 2635 2640 

Arg Tyr Gin Asn Arg Phe Leu His Leu Leu Gly Lys He Tyr Tyr Phe 
2645 2650 2655 
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Gly Asn Asn Ser Lys Ala Val Thr Gly Trp Gin Thr He Asn Gly Lys 
2660 2665 2670 

Val Tyr Tyr Phe Met Pro Asp Thr Ala Met Ala Ala Ala Gly Gly Leu 
2675 2680 26B5 

Phe Glu He Asp Gly Val He Tyr Phe Phe Gly Val Asp Glv Val Lvs 
2690 2695 2700 

Ala Pro Gly He Tyr Gly 
2705 2710 

\2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(y.l) SEQUENCE DESCRIPTION; SEO ID N0:7: 

Ser Tyr Lys He He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp Gly 
15 10 



15 



Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Gly Phe Glu Tvr Phe 

25 30 ' 

Ala Pro Ala Asn Thr Gin Asn Asn Asn He Glu Gly Gin Ala He Val 

"iS 40 45 

Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly Lvs Lys Tyr Tvr Phe Asp 
50 55 60 ' 

Asn Asn Ser Lys Ala Val Thr Gly Trp Arg lie He Asn Asn Glu Lys 
65 70 75 80 

Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala Val Gly Leu Gin Val 
85 90 95 

He Asp Asn Asn Lys Tyr Tyr Phe Asn Pro Asp Thr Ala He He Ser 
100 105 iio 

Lys Gly Trp Gin Thr Val Asn Gly Ser Arg Tvr Tyr Phe Asp Thr Asp 
115 120 * 125 

Thr Ala He Ala Phe Asn Gly Tyr Lys Thr He Asp Gly Lys His Phe 
130 135 140 

Tyr Phe Asp Ser Asp Cys Val Val Lys He Gly Val Phe Ser Thr Ser 
^-^^ 150 155 160 

Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Tyr Asn Asn Asn He 
165 170 175 

Glu Gly Gin Ala He Val Tyr Gin Ser Lys Phe Leu Thr Leu Asn Gly 
180 185 190 

Lys Lys Tyr Tyr Phe Asp Asn Asn Ser Lys Ala Val Thr Gly Leu Gin 
195 200 205 

Thr He Asp Ser Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Glu Ala 
210 215 220 

Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr 
230 235 240 
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Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp Gly Lys Lys 
245 250 255 

Tyr Tyr Phe Asn Thr Asn Thr Ala He Ala Ser Thr Gly Tyr Thr lie 
260 265 270 

He Asn Gly Lys His Phe Tyr Phe Asn Thr Asp Gly He Met Gin He 
275 280 285 

Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro Ala Asn 
290 295 300 

Thr Asp Ala Asn Asn He Glu Gly Gin Ala He Leu Tyr Gin Asn Glu 
305 310 315 320 

Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Gly Ser Asp Ser Lys 
325 330 335 

Ala Val Thr Gly Trp Arg He He Asn Asn Lys Lys Tvr Tyr Phe Asn 
340 345 * 350 

Pro Asn Asn Ala He Ala Ala He His Leu Cys Thr He Asn Asn Asp 
355 360 365 

Lys Tyr Tyr Phe Ser Tyr Asp Gly He Leu Gin Asn Gly Tyr He Thr 
370 375 380 

He Glu Arg Asn Asn Phe Tyr Phe Asp Ala Asn Asn Glu Ser Lys Met 
385 390 395 400 

Val Thr Gly Val Phe Lys Gly Pro Asn Gly Phe Glu Tyr Phe Ala Pro 
405 410 415 

Ala Asn Thr His Asn Asn Asn He Glu Gly Gin Ala He Val Tyr Gin 
420 425 430 

Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys Tyr Tyr Phe Asd Asn Asp 
435 440 445 

Ser Lys Ala Val Thr Gly Trp Gin Thr He Asp Gly Lys Lvs Tyr Tyr 
450 455 460 

Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly Trp Gin Thr He Asp 
465 470 475 480 

Gly Lys Lys Tyr Tyr Phe Asn Leu Asn Thr Ala Glu Ala Ala Thr Gly 
485 490 495 

Trp Gin Thr He Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Phe 
500 505 510 

He Ala Ser Thr Gly Tyr Thr Ser He Asn Gly Lys His Phe Tyr Phe 
515 520 525 

Asn Thr Asp Gly He Met Gin He Gly Val Phe Lys Gly Pro Asn Gly 
530 535 540 

Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly 
545 550 555 560 

Gin Ala He Leu Tyr Gin Asn Lys Phe Leu Thr Leu Asn Gly Lys Lys 
565 570 575 

Tyr Tyr Phe Gly Ser Asp Ser Lys Ala Val Thr Gly Leu Arg Thr He 
580 585 590 

Asp Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr Ala Val Ala Val Thr 
595 600 605 
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Gly Trp Gin Thr He Asn Gly Lys Lys Tyr Tyr Phe Asn Thr Asn Thr 
SIO 615 620 

Ser He Ala Ser Thr Gly Tyr Thr He He Ser Gly Lys His Phe Tyr 
630 635 640 

Phe Asn Thr Asp Gly He Met Gin He Glv Val Phe Lys Gly Pro Asp 
645 650 ' 655 

Gly Phe Glu Tyr Phe Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu 
660 665 670 

Gly Gin Ala He Arg Tyr Gin Asn Arg Phe Leu Tyr Leu His Asp Asn 
675 680 685 

He Tyr Tyr Phe Gly Asn Asn Ser Lys Ala Ala Thr Gly Trp Val Thr 
690 695 700 

He Asp Gly Asn Arg Tyr Tyr Phe Glu Pro Asn Thr Ala Met Gly Ala 
705 710 715 720 

Asn Gly Tyr Lys Thr He Asp Asn Lys Asn Phe Tyr Phe Arq Asn Glv 
725 ' 730 735 

Leu Pro Gin lie Gly Val Phe Lys Gly Ser Asn Gly Phe Glu Tyr Phe 
740 745 750 

Ala Pro Ala Asn Thr Asp Ala Asn Asn He Glu Gly Gin Ala I]e Arq 
755 760 765 

Tyr Gin Asn Arg Phe Leu His Leu Leu Glv Lys He Tyr Tyr Phe Glv 
770 775 " 780 

Asn Asn Ser Lys Ala Val Thr Gly Trp Gin Thr He Asn Gly Lys Val 
785 790 795 ' 800 

Tyr Tyr Phe Met Pro Asp Thr Ala Met Ala Ala 
805 BIO 

i2) INFORMATION FOR SEO ID NO: 8: 

(i) SEQUENCE CF3ARACTERISTICS : 

(A) LENGTH; 91 ammo acids 
vB) TYPE: amino acid 

(C) STRANDEDNESS ; unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ser Tyr Lys He He Asn Gly Lys His Phe Tyr Phe Asn Asn Asp Gly 
i ^ 10 15 

Val Met Gin Leu Gly Val Phe Lys Gly Pro Asp Glv Phe Glu Tyr Phe 
20 25 ' 30 

Ala Pro Ala Asn Thr Gin Asn Asn Asn He Glu Glv Gin Ala He Val 
35 40 ' 45 

Tyr T;in Ser Lys Phe Leu Thr Leu Asn Glv Lys Lvs Tyr Tyr Phe Asp 
50 55 ' 60 

Asn Asn Ser Lys Ala Val Thr Gly Trp Arq He Ho Asn Asn Glu Lys 
65 70 75 80 

Tyr Tyr Phe Asn Pro Asn Asn Ala He Ala Ala 
8 5 90 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7101 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS : single 
{D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

fix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..7098 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



ATG AGT TTA GTT AAT AGA AAA CAG TTA GAA AAA ATG GCA AAT GTA AGA 4 8 

Met Ser Leu Val Asn Arg Lys Gin Leu Glu Lys Met Ala Asn Val Arg 
1 5 10 15 

TTT CGT ACT CAA GAA GAT GAA TAT GTT GCA ATA TTG GAT GCT TTA GAA 96 
Phe Arg Thr Gin Glu Asp Glu Tyr Val Ala lie Leu Asp Ala Leu Glu 
20 25 30 

25 GAA TAT CAT AAT ATG TCA GAG AAT ACT GTA GTC GAA AAA TAT TTA AAA 144 

Glu Tyr His Asn Met Ser Glu Asn Thr Val Val Glu Lys Tyr Leu Lys 
iS 40 4B 

TTA AAA GAT ATA AAT AGT TTA AC A GAT ATT TAT ATA GAT ACA TAT AAA 192 
30 Lea Lys Asp lie Asn Ser Leu Thr Asp lie Tyr lie Asp Thr Tyr Lys 

SO 55 60 

AAA TCT GGT AGA AAT AAA GCC TTA AAA AAA TTT AAG GAA TAT CTA GTT 24 0 

Lys Ser Gly Arg Asn Lys Ala Leu Lys Lys Phe Lys Glu Tyr Leu Val 
35 65 70 75 80 

ACA GAA GTA TTA GAG CTA AAG AAT AAT AAT TTA ACT CCA GTT GAG AAA 28B 

Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr Pro Val Glu Lys 
85 90 95 

40 

AAT TTA CAT TTT GTT TGG ATT GGA GGT CAA ATA AAT GAC ACT GCT ATT 3 36 

Asn Leu His Phe Val Trp lie Gly Gly Gin He Asn Asp Thr Ala He 
100 105 110 

45 AAT TAT ATA AAT CAA TGG AAA GAT GTA AAT AGT GAT TAT AAT GTT AAT 3 84 

Asn Tyr He Asn Gin Trp Lys Asp Val Asn Ser Asp Tyr Asn Val Asn 
115 120 125 

GTT TTT TAT GAT AGT AAT GCA TTT TTG ATA AAC ACA TTG AAA AAA ACT 4 32 

50 Val Phe Tvr Asp Ser Asn Ala Phe Leu Xle Asn Thr Leu Lys Lys Thr 

130 ' 135 140 

GTA GTA GAA TCA GCA ATA AAT GAT ACA CTT GAA TCA TTT AGA GAA AAC 48 0 

Val Val Glu Ser Ala He Asn Asp Thr Leu Glu Ser Phe Arg Glu Asn 
55 145 150 155 160 

TTA AAT GAC CCT AGA TTT GAC TAT AAT AAA TTC TTC AGA AAA CGT ATG 528 
Leu Asn Asp Pro Arg Phe Asp Tyr Asn Lys Phe Phe Arg Lys Arg Met 
165 170 175 



GAA ATA ATT TAT GAT AAA CAG AAA AAT TTC ATA AAC TAC TAT AAA GCT 576 
Glu He He Tyr Asp Lys Gin Lys Asn Phe He Asn Tyr Tyr Lys Ala 
180 185 190 



65 CAA AGA GAA GAA AAT CCT GAA CTT ATA ATT GAT GAT ATT GTA AAG ACA 624 

Gin Arg Glu Glu Asn Pro Glu Leu He He Asp Asp He Val Lys Thr 
195 200 205 
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TTA TTA ATG TTT AAA GAA GGC AGT ATG AAT ATC CAT TTG ATA GAA GCT 1488 
Leu Leu Met Phe Lys Glu Gly Ser Met Asn He His Leu He Glu Ala 
485 490 495 

GAT TTA AGA AAC TTT GAA ATC TCT AAA ACT AAT ATT TCT CAA TCA ACT 1536 
Asp Leu Arg Asn Phe Glu He Ser Lys Thr Asn lie Ser Gin Ser Thr 
500 505 510 

GAA CAA GAA ATG GCT AGC TTA TGG TCA TTT GAC GAT 6CA AGA GCT AAA 1584 
Glu Gin Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala Arg Ala Lys 
515 520 525 



GCT CAA TTT GAA GAA TAT AAA AGG AAT TAT TTT GAA GGT TCT CTT GGT 16 32 

Ala Gin Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly Ser Leu Glv 
1:* 530 535 540 

GAA GAT GAT AAT CTT GAT TTT TCT CAA AAT ATA GTA GTT GAC AAG GAG 1680 
Glu Asp Asp Asn Leu Asp Phe Ser Gin Asn He Val Val Asp Lys Glu 
545 550 555 560 

TAT CTT TTA GAA AAA ATA TCT TCA TTA GCA AGA AGT TCA GAG AGA GGA 1728 
Tyr Leu Leu Glu Lys He Ser Ser Leu Ala Arg Ser Ser Glu Arg Gly 
565 570 ' 575 

25 TAT ATA CAC TAT ATT GTT CAG TTA CAA GGA GAT AAA ATT AGT TAT GAA 1776 

Tyr lie His Tyr He Val Gin Leu Gin Gly Asp Lys He Ser Tyr Glu 
580 585 590 

^ GCA GCA TGT AAC TTA TTT GCA AAG ACT CCT TAT GAT AGT GTA CTG TTT 1824 

M) Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr Asp Ser Val Leu Phe 

595 600 605 

CAG AAA AAT ATA GAA GAT TCA GAA ATT GCA TAT TAT TAT AAT CCT GGA 187 2 

Gin Lys Asn He Glu Asp Ser Glu He Ala Tyr Tyr Tyr Asn Pro Gly 
^10 615 620 



GAT GGT GAA ATA CAA GAA ATA GAC AAG TAT AAA 7^TT CCA AGT ATA ATT 192 0 

Asp Gly Glu He Gin Glu He Asp Lys Tyr Lys He Pro Ser He He 
^25 630 635 640 

TCT GAT AGA CCT AAG ATT AAA TTA ACA TTT ATT GGT CAT GGT AAA GAT 1968 
Ser Asp Arg Pro Lys He Lys Leu Thr Phe He Gly His Gly Lys Asp 
645 650 655 



4.^ GAA TTT AAT ACT GAT ATA TTT GCA GGT TTT GAT GTA GAT TCA TTA TCC 

Glu Phe Asn Thr Asp He Phe Ala Gly Phe Asp Val Asp Ser Leu Ser 
660 665 670 



6> GTA AGT GCA AAT CAA TAT GAA GTT AGA ATA AAT AGT GAA GGA AGA AGA 

Val Ser Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly Arg Arg 
740 745 750 



- 243 



?016 



ACA GAA ATA GAA GCA GCA ATA GAT TTA GCT AAA GAG GAT ATT TCT CCT 2064 
Thr Glu He Glu Ala Ala He Asp Leu Ala Lys Glu Asp He Ser Pro 
675 680 685 



AAG TCA ATA GAA ATA AAT TTA TTA GGA TGT AAT ATG TTT AGC TAC TCT 21i: 
Lys Ser He Glu He Asn Leu Leu Gly Cys Asn Met Phe Ser Tyr Ser 
690 695 700 



ATC AAC GTA GAG GAG ACT TAT CCT GGA AAA TTA TTA CTT AAA GTT AAA 216 0 

He Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Lvs Val Lys 
705 710 715 * 720 

GAT AAA ATA TCA GAA TTA ATG CCA TCT ATA AGT CAA GAC TCT ATT ATA 22 08 

Asp Lys He Ser Glu Leu Met Pro Ser He Ser Gin Asp Ser He He 
725 730 735 
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GAA TTA TTG GAT CAT TCT GGT GAA TGG ATA AAT AAA GAA GAA AGT ATT 2304 
Glu Leu Leu Asp His Ser Gly Glu Trp lie Asn Lys Glu Glu Ser He 
755 760 765 

5 ATA AAG GAT ATT TCA TCA AAA GAA TAT ATA TCA TTT AAT CCT AAA GAA 23 52 

lie Lys Asp lie Ser Ser Lys Glu Tyr He Ser Phe Asn Pro Lys Glu 
770 775 780 

AAT AAA ATT AC A GTA AAA TCT AAA AAT TTA CCT GAG CTA TCT ACA TTA 2400 
10 Asn Lys He Thr Val Lys Ser Lys Asn Leu Pro Glu Leu Ser Thr Leu 

785 790 795 800 

TTA CAA GAA ATT AGA AAT AAT TCT AAT TCA AGT GAT ATT GAA CTA GAA 2448 
Leu Gin Glu He Arg Asn Asn Ser Asn Ser Ser Asp lie Glu Leu Glu 
15 805 BIO 815 

GAA AAA GTA ATG TTA ACA GAA TGT GAG ATA AAT GTT ATT TCA AAT ATA 2496 

Glu Lys Val Met Leu Thr Glu Cys Glu He Asn Val He Ser Asn lie 

820 825 830 

:o 

GAT ACG CAA ATT GTT GAG GAA AGG ATT GAA GAA GCT AAG AAT TTA ACT 2 544 

Asp Thr Gin He Val Glu Glu Arg lie Glu Glu Ala Lys Asn Leu Thr 

835 840 845 

25 TCT GAC TCT ATT AAT TAT ATA AAA GAT GAA TTT AAA CTA ATA GAA TCT 2592 

Ser Asp Ser He Asn Tyr He Lys Asp Glu Phe Lys Leu He Glu Ser 
850 855 860 

ATT TCT GAT GCA CTA TGT GAC TTA AAA CAA CAG AAT GAA TTA GAA GAT 264 0 

30 lie Ser Asp Ala Leu Cys Asp Leu Lys Gin Gin Asn Glu Leu Glu Asp 

865 870 875 880 

TCT CAT TTT ATA TCT TTT GAG GAC ATA TCA GAG ACT GAT GAG GGA TTT 2688 
Ser His Phe He Ser Phe Glu Asp He Ser Glu Thr Asp Glu Gly Phe 
35 885 890 895 

AGT ATA AGA TTT ATT AAT AAA GAA ACT GGA GAA TCT ATA TTT GTA GAA 2736 

Ser He Arg Phe He Asn Lys Glu Thr Gly Glu Ser He Phe Val Glu 
900 905 910 

40 

ACT GAA AAA ACA ATA TTC TCT GAA TAT GCT AAT CAT ATA ACT GAA GAG 2 784 

Thr Glu Lys Thr He Phe Ser Glu Tyr Ala Asn Hrs He Thr Giu Glu 
915 920 925 

45 ATT TCT AAG ATA AAA GGT ACT ATA TTT GAT ACT GTA AAT GGT AAG TTA 2832 

He Ser Lys He Lys Gly Thr He Phe Asp Thr Val Asn Gly Lys Leu 
930 ' 935 940 

GTA AAA AAA GTA AAT TTA GAT ACT ACA CAC GAA GTA AAT ACT TTA AAT 2880 
50 Val Lys Lys Val Asn Leu Asp Thr Thr His Glu Val Asn Thr Leu Asn 

945 950 955 960 

GCT GCA TTT TTT ATA CAA TCA TTA ATA GAA TAT AAT AGT TCT AAA GAA 2 928 

Ala Ala Phe Phe He Gin Ser Leu He Glu Tyr Asn Ser Ser Lys Glu 
55 965 970 975 

TCT CTT AGT AAT TTA AGT GTA GCA ATG AAA GTC CAA GTT TAC GCT CAA 2976 
Ser Leu Ser Asn Leu Ser Val Ala Met Lys Val Gin Val Tyr Ala Gin 
980 985 990 



60 



TTA TTT AGT ACT GGT TTA AAT ACT ATT ACA GAT GCA GCC AAA GTT GTT 3024 
Leu Phe Ser Thr Gly Leu Asn Thr He Thr Asp Ala Ala Lys Val Val 
995 1000 1005 
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GAA TTA GTA TCA ACT GCA TTA GAT GAA ACT ATA GAC TTA CTT CCT ACA 3 072 

Glu Leu Val Ser Thr Ala Leu Asp Glu Thr He Asp Leu Leu Pro Thi* 
1010 1015 1020 

5 TTA TCT GAA GGA TTA CCT ATA ATT GCA ACT ATT ATA GAT GGT GTA AGT 3120 

Leu Ser Glu Gly Leu Pro He He Ala Thr He He Asp Gly Val Ser 
1025 1030 1035 1040 

TTA GGT GCA GCA ATC AAA GAG CTA AGT GAA ACG AGT GAC CCA TTA TTA 3168 
10 Leu Gly Ala Ala He Lys Glu Leu Ser Glu Thr Ser Asp Pro Leu Leu 

104S 1050 1055 

AG A CAA GAA ATA GAA OCT AAG ATA GGT ATA ATG GCA GTA AAT TTA ACA 3 216 

Arg Gin Glu He Glu Ala Lys lie Gly He Met Ala Val Asn Leu Thr 
1-^ 1060 1065 1070 



20 



40 



-^0 



(>() 



65 



ACA GCT ACA ACT GCA ATC ATT ACT TCA TCT TTG GGG ATA GCT AGT GGA 3264 
Thr Ala Thr Thr Ala He He Thr Ser Ser Leu Gly He Ala Ser Gly 
1075 1080 1085 

TTT AGT ATA CTT TTA GTT CCT TTA GCA GGA ATT TCA GCA GGT ATA CCA 3312 
Phe Ser He Leu Leu Val Pro Leu Ala Gly He Ser Ala Gly He Pro 
1090 1095 1100 

AGC TTA GTA AAC AAT GAA CTT GTA CTT CGA GAT AAG GCA ACA AAG GTT 336 0 

Ser Leu Val Asn Asn Glu Leu Val Leu Arg Asp Lys Ala Thr Lys Val 
1110 1115 1120 

GTA GAT TAT TTT AAA CAT GTT TCA TTA GTT GAA ACT GAA GGA GTA TTT 34 08 

Val Asp Tyr Phe Lys His Val Ser Leu Val Glu Thr Glu Gly Val Phe 
H25 X130 1135 

ACT TTA TTA GAT GAT AAA ATA ATG ATG CCA CAA GAT GAT TTA GTG ATA 34 56 

Thr Leu. Leu Asp Asp Lys He Met Met Pro Gin Asp Asp Leu Val He 
1140 1145 1150 

TCA GAA ATA GAT TTT AAT AAT AAT TCA ATA GTT TTA GGT AAA TGT GAA 3 504 

Ser Glu He Asp Phe Asn Asn Asn Ser He Val Leu Gly Lys C/s Glu 
1155 1160 1165 

ATC TGG AGA ATG GAA GGT GGT TCA GGT CAT ACT GTA ACT GAT GAT ATA 3 5 52 

He Trp Arg Met Glu Gly Gly Ser Gly His Thr Val Thr Asp Asp He 
1170 1175 1180 



4^ GAT CAC TTC TTT TCA GCA CCA TCA ATA ACA TAT AGA GAG CCA CAC TTA 

Asp His Phe Phe Ser Ala Pro Ser He Thr Tyr Arg Glu Pro Hin Leu 
J 185 1190 1195 1200 



3600 



TCT ATA TAT GAC GTA TTG GAA GTA CAA AAA GAA GAA CTT GAT TTG TCA 3 648 

Ser He Tyr Asp Val Leu Glu Val Gin Lys Glu Glu Leu Asp Leu Ser 
1205 1210 1215 

AAA GAT TTA ATG GTA TTA CCT AAT GCT CCA AAT AGA GTA TTT GCT TGG 36 96 

Lys Asp Leu Met Val Leu Pro Asn Ala Pro Asn Arg Val Phe Ala Trp 
1220 1225 1230 

GAA ACA GGA TGG ACA CCA GGT TTA AGA AGC TTA GAA AAT GAT GGC ACA 3 74 4 

Glu Thr Gly Trp Thr Pro Gly Leu Arg Ser Leu Glu Asn Asp Gly Thr 
1235 1240 1245 

AAA CTG TTA GAC CGT ATA AGA GAT AAC TAT GAA GGT GAG TTT TAT TGG 3 792 

Lys Leu Leu Asp Arg He Arg Asp Asn Tyr Glu Gly Glu Phe Tyr Trp 
1250 1255 1260 

AGA TAT TTT GCT TTT ATA GCT GAT GCT TTA ATA ACA ACA TTA AAA CCA 3 840 

Arg Tyr Phe Ala Phe He Ala Asp Ala Leu He Thr Thr Leu Lys Pro 
1265 1270 1275 1280 
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AG A TAT GAA GAT ACT AAT ATA AGA ATA AAT TTA GAT AGT AAT ACT AGA 3888 
Arg Tyr Glu Asp Thr Asn lie Arg He Asn Leu Asp Ser Asn Thr Arg 
1285 1290 1295 

AGT TTT ATA GTT CCA ATA ATA ACT ACA GAA TAT ATA AGA GAA AAA TTA 3 936 

ser Phe He Val Pro He He Thr Thr Glu Tyr He Arg Glu Lys Leu 
1300 1305 1310 

TCA TAT TCT TTC TAT GGT TCA GGA GGA ACT TAT GCA TTG TCT CTT TCT 3 984 

Ser Tyr Ser Phe Tyr Gly Ser Gly Gly Thr Tyr Ala Leu Ser Leu Ser 
1315 1320 1325 

CAA TAT AAT ATG GGT ATA AAT ATA GAA TTA AGT GAA AGT GAT GTT TGG 4 0 32 

Gin Tyr Asn Met Gly He Asn He Glu Leu Ser Glu Ser Asp Val Tro 
'-^ 1330 1335 1340 



10 



40 



50 



60 



ATT ATA GAT GTT GAT AAT GTT GTG AGA GAT GTA ACT ATA GAA TCT GAT 
He He Asp Val Asp Asn Val Val Arg Asp Val Thr He Glu Ser Asp 
1345 1350 1355 136O 



4^ CAG AAA AAT ATA CCA TAT AGC TTT GTA GAT AGT GAA GGA AAA GAG AAT 

Gin Lys Asn He Pro Tyr Ser Phe Val Asp Ser Glu Gly Ly? Glu Asn 
1460 1465 1470 



4080 



AAA ATT AAA AAA GGT GAT TTA ATA GAA GGT ATT TTA TCT ACA CTA AGT 4128 
Lys He Lys Lys Gly Asp Leu He Glu Gly He Leu Ser Thr Leu Ser 
1365 1370 1375 

2> ATT GAA GAG AAT AAA ATT ATC TTA AAT AGC CAT GAG ATT AAT TTT TCT 4176 

He Glu Glu Asn Lys He He Leu Asn Ser His Glu He Asn Phe Ser 
1380 1385 1390 

GGT GAG GTA AAT GGA AGT AAT GGA TTT GTT TCT TTA ACA TTT TCA ATT 4 2 24 

Gly Glu Val Asn Gly Ser Asn Gly Phe Val Ser Leu Thr Phe Ser He 
1395 1400 1405 

TTA GAA GGA ATA AAT GCA ATT ATA GAA GTT GAT TTA TTA TCT AAA TCA 4 2 72 

Leu Glu Gly He Asn Ala He He Glu Val Asp Leu Leu Ser Lys Set 
-O 1410 1415 1420 



TAT AAA TTA CTT ATT TCT GGC GAA TTA AAA ATA TTG ATG TTA AAT TCA 4 320 

Tyr Lys Leu Leu He Ser Gly Glu Leu Lys He Leu Met Leu Asn Ser 
1425 1430 1435 1440 

AAT CAT ATT CAA CAG AAA ATA GAT TAT ATA GGA TTC AAT AGC GAA TTA 4 368 

Asn His He Gin Gin Lys He Asp Tyr He Gly Phe Asn Sor Glu Leu 

1445 1450 1455 



4416 



GGT TTr ATT AAT GGT TCA ACA AAA GAA GGT TTA TTT GTA TCT GAA TTA 446 4 

Gly Phe He Asn Gly Ser Thr Lys Glu Gly Leu Phe Val Ser Glu Leu 
1475 1480 1485 

CCT GAT GTA GTT CTT ATA AGT AAG GTT TAT ATG GAT GAT AGT AAG CCT 4 512 

Pro Asp Val Val Leu He Ser Lys Val Tyr Met Asp Asp Ser Lys Pro 
1490 1495 1500 

TCA TTT GGA TAT TAT AGT AAT AAT TTG AAA GAT GTC AAA GTT ATA ACT 4 560 

Ser Phe Gly Tyr Tyr Ser Asn Asn Leu Lys Asp Val Lys Val He Thr 
1S05 1510 1515 1520 

AAA GAT AAT GTT AAT ATA TTA ACA GGT TAT TAT CTT AAG GAT GAT ATA 4 60 8 

Lys Asp Asn Val Asn He Leu Thr Gly Tyr Tyr Leu Lvs Asp Asp He 
1525 1530 ' 1535 

65 AAA ATC TCT CTT TCT TTG ACT CTA CAA GAT GAA AAA ACT ATA AAG TTA 4 6 56 

Lys He Ser Leu Ser Leu Thr Leu Gin Asp Glu Lys Thr He Lys Leu 
1540 1545 1550 
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AAT AGT GTG CAT TTA GAT GAA AGT GGA GTA GCT GAG ATT TTG AAG TTC 4704 
Asn Ser Val His Leu Asp Glu Ser Gly Val Ala Glu lie Leu Lys Phe 
1555 1560 1565 

5 ATG AAT AGA AAA GGT AAT ACA AAT ACT TCA GAT TCT TTA ATG AGC TTT 4752 

Met Asn Arg Lys Gly Asn Thr Asn Thr Ser Asp Ser Leu Met Ser Phe 
1570 1575 1580 

TTA GAA AGT ATG AAT ATA AAA AGT ATT TTC GTT AAT TTC TTA CAA TCT 4800 
10 Leu Glu Ser Met Asn lie Lys Ser He Phe Val Asn Phe Leu Gin Ser 
1585 1590 1595 1600 

AAT ATT AAG TTT ATA TTA GAT GCT AAT TTT ATA ATA AGT GGT ACT ACT 4 648 

Asn He Lys Phe He Leu Asp Ala Asn Phe lie lie Ser Gly Thr Thr 
15 1605 1610 1615 

TCT ATT GGC CAA TTT GAG TTT ATT TGT GAT GAA AAT GAT AAT ATA CAA 48 96 

Ser He Gly Gin Phe Glu Phe He Cys Asp Glu Asn Asp Asn He Gin 
1620 1625 1630 

20 

CCA TAT TTC ATT AAG TTT AAT ACA CTA GAA ACT AAT TAT ACT TTA TAT 4 944 

Pro Tyr Phe He Lys Phe Asn Thr Leu Glu Thr Asn Tyr Thr Leu Tyr 
1635 1640 1645 

25 GTA GGA AAT AGA CAA AAT ATG ATA GTG GAA CCA AAT TAT GAT TTA GAT 4 992 

Val Gly Asn Arg Gin Asn Met He Val Glu Pro Asn Tyr Asp Leu Asp 
1650 1655 1660 

GAT TCT GGA GAT ATA TCT TCA ACT GTT ATC AAT TTC TCT CAA AAG TAT 504 0 

30 Asp Ser Gly Asp He Ser Ser Thr Val He Asn Phe Ser Gin Lys Tyr 

1665 1670 1675 1680 

CTT TAT GGA ATA GAC AGT TGT GTT AAT AAA GTT GTA ATT TCA CCA AAT 5088 
Leu Tyr Gly He Asp Ser Cys Val Asn Lys Val Val He Ser Pro Asn 
35 1685 1690 1695 

ATT TAT ACA GAT GAA ATA AAT ATA ACG CCT GTA TAT GAA ACA AAT AAT 5136 
He Tyr Thr Asp Glu He Asn He Thr Pro Val Tyr Glu Thr Asn Asn 
1700 1705 1710 

40 

ACT TAT CCA GAA GTT ATT GTA TTA GAT GCA AAT TAT ATA AAT GAA AAA 5184 
Thr Tyr Pro Glu Val He Val Leu Asp Ala Asn Tyr He Asn Glu Lys 
1715 1720 1725 

45 ATA AAT GTT AAT ATC AAT GAT CTA TCT ATA CGA TAT GTA TGG AGT AAT 5232 

He Asn Val Asn He Asn Asp Leu Ser He Arg Tyr Val Trp Ser Asn 
1730 1735 1740 

GAT GGT AAT GAT TTT ATT CTT ATG TCA ACT AGT GAA GAA AAT AAG GTG 528 0 

50 Asp Gly Asn Asp Phe He Leu Met Ser Thr Ser Glu Glu Asn Lys Val 

1745 1750 1755 1760 

TCA CAA GTT AAA ATA AGA TTC GTT AAT GTT TTT AAA GAT AAG ACT TTG 5328 
Ser Gin Val Lys lie Arg Phe Val Asn Val Phe Lys Asp Lys Thr Leu 
55 1765 1770 1775 

GCA AAT AAG CTA TCT TTT AAC TTT AGT GAT AAA CAA GAT GTA CCT GTA 5376 
Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp Lys Gin Asp Val Pro Val 
1780 1785 1790 



60 



AGT GAA ATA ATC TTA TCA TTT ACA CCT TCA TAT TAT GAG GAT GGA TTG 54 24 

Ser Glu He He Leu Ser Phe Thr Pro Ser Tyr Tyr Glu Asp Glv Leu 
1795 1800 1805 



65 ATT GGC TAT GAT TTG GGT CTA GTT TCT TTA TAT 7UVT GAG AAA TTT TAT 54 7: 

He Gly Tyr Asp Leu Gly Leu Val Ser Leu Tyr Asn Glu Lys Phe Tyr 
1810 1815 1820 
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ATT AAT AAC TTT GGA ATG ATG GTA TCT GGA TTA ATA TAT ATT AAT GAT 5520 
He Asn Asn Phe Gly Met Met Val Ser Gly Leu lie Tyr He Asn Asp 
1825 1830 1835 1840 

5 TCA TTA TAT TAT TTT AAA CCA CCA GTA AAT AAT TTG ATA ACT GGA TTT 5568 

Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn Asn Leu He Thr Gly Phe 
1845 1850 1855 

GTG ACT GTA GGC GAT GAT AAA TAG TAC TTT AAT CCA ATT AAT GGT GGA 5616 
10 Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn Pro He Asn Gly Gly 

1860 1865 1670 

GCT GCT TCA ATT GGA GAG ACA ATA ATT GAT GAC AAA AAT TAT TAT TTC 5664 
Ala Ala Ser He Gly Glu Thr He He Asp Asp Lys Asn Tyr Tyr phe 
15 1875 1880 18B5 

AAC CAA AGT GGA GTG TTA CAA ACA GGT GTA TTT AGT ACA GAA GAT GGA 5712 

Asn Gin Ser Gly Val Leu Gin Thr Gly Val Phe Ser Thr Glu Asp Gly 
1890 1895 1900 

20 

TTT AAA TAT TTT GCC CCA GCT AAT ACA CTT GAT GAA AAC CTA GAA GGA 5760 

Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu Asp Glu Asn Leu Glu Gly 
1905 1910 1915 1920 

25 GAA GCA ATT GAT TTT ACT GGA AAA TTA ATT ATT GAC GAA AAT ATT TAT 5808 

Glu Ala He Asp Phe Thr Gly Lys Leu He He Asp Glu Asn He Tyr 
1925 1930 1935 

TAT TTT GAT GAT AAT TAT AGA GGA GCT GTA GAA TGG AAA GAA TTA GAT 58 56 

30 Tyr Phe Asp Asp Asn Tyr Arg Gly Ala Val Glu Trp Lys Glu Leu Asp 

1940 1945 1950 

GGT GAA ATG CAC TAT TTT AGC CCA GAA ACA GGT AAA GCT TTT AAA GGT 5 904 

Glv Glu Met His Tyr Phe Ser Pro Glu Thr Gly Lys Ala Phe Lys Gly 
35 * 1955 1960 1965 

CTA AAT CAA ATA GGT GAT TAT AAA TAC TAT TTC AAT TCT GAT GGA GTT 5 952 

Leu Asn Gin He Gly Asp Tyr Lys Tyr Tyr Phe Asn Ser Asp Gly Val 
1970 1975 1980 

40 

ATG CAA AAA GGA TTT GTT AGT ATA AAT GAT AAT AAA CAC TAT TTT GAT 6000 

Met Gin Lys Gly Phe Val Ser He Asn Asp Asn Lys His Tyr Phe Asp 

1985 1990 1995 ' 2000 

45 GAT TCT GGT GTT ATG AAA GTA GGT TAC ACT GAA ATA GAT GGC AAG CAT 604 8 

Asp Ser Glv Val Met Lys Val Gly Tyr Thr Glu He Asp Gly Lys His 
2005 2010 2015 

TTC TAC TTT GCT GAA AAC GGA GAA ATG CAA ATA GGA GTA TTT AAT ACA 6096 
50 Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin He Gly Val Phe Asn Thr 

2020 2025 2030 

GAA GAT GGA TTT AAA TAT TTT GCT CAT CAT AAT GAA GAT TTA GGA AAT 6144 
Glu Asp Gly Phe Lys Tyr Phe Ala His His Asn Glu Asp Leu Gly Asn 
55 2035 2040 2045 

GAA GAA GGT GAA GAA ATC TCA TAT TCT GGT ATA TTA AAT TTC AAT AAT 6192 
Glu Glu Gly Glu Glu He Ser Tyr Ser Gly He Leu Asn Phe Asn Asn 
2050 2055 2060 



(>() 



AAA ATT TAC TAT TTT GAT GAT TCA TTT ACA GCT GTA GTT GGA TGG AAA 624 0 

Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr Ala Val Val Gly Trp Lys 
2065 2070 2075 2080 



65 GAT TTA GAG GAT GGT TCA AAG TAT TAT TTT GAT GAA GAT ACA GCA GAA 628 8 

Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe Asp Glu Asp Thr Ala Glu 
2085 2090 2095 
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GCA TAT ATA GGT TTG TCA TTA ATA AAT GAT GGT CAA TAT TAT TTT AAT 6336 
Ala Tyr He Gly Leu Ser Leu He Asn Asp Gly Gin Tyr Tyr Phe Asn 
2100 2105 2110 

5 GAT GAT GGA ATT ATG CAA GTT GGA TTT GTC ACT ATA AAT GAT AAA GTC 63 84 

Asp Asp Gly He Met Gin Val Gly Phe Val Thr lie Asn Asp Lys Val 
2115 2120 2125 

TTC TAC TTC TCT GAC TCT GGA ATT ATA GAA TCT GGA GTA CAA AAC ATA .6432 
10 Phe Tyr Phe Ser Asp Ser Gly He He Glu Ser Gly Val Gin Asn He 

2130 2135 2140 

GAT GAC AAT TAT TTC TAT ATA GAT GAT AAT GGT ATA GTT CAA ATT GGT 648 0 

Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly He Val Gin He Gly 
15 2145 2150 2155 2160 

GTA TTT GAT ACT TCA GAT GGA TAT AAA TAT TTT GCA CCT GCT AAT ACT 6 52 8 

Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe Ala Pro Ala Asn Thr 
2165 2170 2175 

20 

GTA AAT GAT AAT ATT TAC GGA CAA GCA GTT GAA TAT AGT GGT TTA GTT 6 5 76 

Val Asn Asp Asn He Tyr Gly Gin Ala Val Glu Tyr Ser Gly Leu Val 
2180 2185 2190 

25 AGA GTT GGG GAA GAT GTA TAT TAT TTT GGA GAA ACA TAT ACA ATT GAG 66 24 

Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu Thr Tyr Thr He Glu 
2195 2200 2205 

ACT GGA TGG ATA TAT GAT ATG GAA AAT GAA AGT GAT AAA TAT TAT TTC 66 72 

30 Thr Gly Trp He Tyr Asp Met Glu Asn Glu Ser Asp Lys Tyr Tyr Phe 
2210 2215 2220 

AAT CCA GAA ACT AAA AAA GCA TGC AAA GGT ATT AAT TTA ATT GAT GAT 6 72 0 

Asn Pro Glu Thr Lys Lys Ala Cys Lys Gly He Asn Leu He Asp Asp 
35 2225 2230 2235 2240 

ATA AAA TAT TAT TTT GAT GAG AAG GGC ATA ATG AGA ACG GGT CTT ATA 6 76 8 

He Lys Tyr Tyr Phe Asp Glu Lys Gly He Met Arg Thr Gly Leu He 
2245 2250 2255 

40 

TCA TTT GAA AAT AAT AAT TAT TAC TTT AAT GAG AAT GGT GAA ATG CAA 6 816 

Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn Glu Asn Gly Glu Met Gin 
2260 2265 2270 

45 TTT GGT TAT ATA AAT ATA GAA GAT AAG ATG TTC TAT TTT GGT GAA GAT 6864 

Phe Gly Tyr He Asn He Glu Asp Lys Met Phe Tyr Phe Gly Glu Asp 
2275 2280 2285 

GGT GTC ATG CAG ATT GGA GTA TTT AAT ACA CCA GAT GGA TTT AAA TAC 6 912 

50 Gly Val Met Gin He Gly Val Phe Asn Thr Pro Asp Gly Phe Lys Tyr 

2290 2295 2300 

TTT GCA CAT CAA AAT ACT TTG GAT GAG AAT TTT GAG GGA GAA TCA ATA 6960 
Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe Glu Gly Glu Ser He 
55 2305 2310 2315 2320 

AAC TAT ACT GGT TGG TTA GAT TTA GAT GAA AAG AGA TAT TAT TTT ACA 70 08 

Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys Arg Tyr Tyr Phe Thr 
2325 2330 2335 



60 



GAT GAA TAT ATT GCA GCA ACT GGT TCA GTT ATT ATT GAT GGT GAG GAG 7056 
Asp Glu Tyr He Ala Ala Thr Gly Ser Val He He Asp Gly Glu Glu 
2340 2345 2350 



65 TAT TAT TTT GAT CCT GAT ACA GCT CAA TTA GTG ATT AGT GAA 7098 

Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu Val He Ser Glu 
2355 2360 2365 



70 



TAG 7101 
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(2) INFORMATION FOR SEQ ID NO: 10; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2366 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 (xi) SEQUENCE DESCRIPTION; SEQ ID NO: 10: 

Met Ser Leu Val Asn Arg Lys Gin Leu Glu Lys Met Ala Asn Vai Arg 
15 10 15 

15 Phe Arg Thr Gin Glu Asp Glu Tyr Val Ala lie Leu Asp Ala Leu Glu 

20 25 30 



20 



Glu Tyr His Asn Met Ser Glu Asn Thr Val Val Glu Lys Tyr Leu Lys 
35 40 45 

Leu Lys Asp lie Asn Ser Leu Thr Asp lie Tyr lie Asp Thr Tyr Lys 
50 55 60 

Lys Ser Gly Arg Asn Lys Ala Leu Lys Lys Phe Lys Glu Tyr Leu Val 
25 65 70 * 75 80 

Thr Glu Val Leu Glu Leu Lys Asn Asn Asn Leu Thr Pro Val Glu Lys 
B5 90 95 

30 Asn Leu His Phe Val Trp lie Gly Gly Gin lie Asn Asp Thr Ala lie 

100 105 110 



50 



65 



Asn Tyr lie Asn Gin Trp Lys Asp Val Asn Ser Asp Tyr Asn Val Asn 

115 120 125 

Val Phe Tyr Asp Ser Ann Ala Phe Leu lie Asn Thr Leu Lys Lys Thr 

130 135 140 



Val Val Glu Ser Ala lie Asn Asp Thr Leu Glu Ser Phe Arg Glu Asn 

40 145 150 155 160 

Leu Asn Asp Pro Arg Phe Asp Tyr Asn Lys Phe Phe Arg Lys Arg Met 

165 170 ~ 175 

45 Glu lie lie Tyr Asp Lys Gin Lys Asn Phe lie Asn Tyr Tyr Lys Ala 

180 185 190 



Gin Arg Glu Glu Asn Pro Glu Leu lie lie Asp Asp lie Val Lys Thr 

195 200 205 

Tyr Leu Ser Asn Glu Tyr Ser Lys Glu He Asp Glu Leu Asn Thr Tyr 
210 215 220 



He Glu Glu Ser Leu Asn Lys He Thr Gin Asn Ser Gly Asn Asp Val 
55 225 230 235 240 

Arg Asn Phe Glu Glu Phe Lys Asn Gly Glu Ser Phe Asn Leu Tyr Glu 
245 250 255 

60 Gin Glu Leu Val Glu Arg Trp Asn Leu Ala Ala Ala Ser Asp He Leu 

260 265 270 



Arg He Ser Ala Leu Lys Glu He Gly Gly Met Tyr Leu Asp Val Asp 

275 280 285 

Met Leu Pro Gly He Gin Pro Asp Leu Phe Glu Ser He Glu Lys Pro 
290 295 300 



Ser Ser Val Thr Val Asp Phe Trp Glu Met Thr Lys Leu Glu Ala He 
70 305 310 315 320 
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Met Lys Tyr Lys Glu Tyr He Pro Glu Tyr Thr Ser Glu His Phe Asp 
325 330 335 

Met Leu Asp Glu Glu Val Gin Ser Ser Phe Glu Ser Val Leu Ala Ser 
340 345 

Lys Ser Asp Lys Ser Glu He Phe Ser Ser Leu Gly Asp Met Glu Ala 
355 360 365 

Ser Pro Leu Glu Val Lys He Ala Phe Asn Ser Lys Gly He He Asn 
370 375 380 

Gin Gly Leu He Ser Val Lys Asp Ser Tyr Cys Ser Asn Leu He Val 
385 390 395 40O 

Lys Gin He Glu Asn Arg Tyr Lys He Leu Asn Asn Ser Leu Asn Pro 
405 410 415 

Ala He Ser Glu Asp Asn Asp Phe Asn Thr Thr Thr Asn Thr Phe He 
420 425 430 

Asp Ser He Met Ala Glu Ala Asn Ala Asp Asn Gly Arg Phe Met Met 
435 440 445 

Glu Leu Gly Lys Tyr Leu Arg Val Gly Phe Phe Pro Asp Val Lys Thr 
450 455 460 

Thr He Asn Leu Ser Gly Pro Glu Ala Tyr Ala Ala Ala Tyr Gin Asp 
465 470 475 460 

Leu Leu Met Phe Lys Glu Gly Ser Met Asn He His Leu He Glu Ala 
485 490 495 

Asp Leu Arg Asn Phe Glu He Ser Lys Thr Asn He Ser Gin Ser Thr 
500 505 510 

Glu Gin Glu Met Ala Ser Leu Trp Ser Phe Asp Asp Ala Arg Ala Lys 
515 520 525 

Ala Gin Phe Glu Glu Tyr Lys Arg Asn Tyr Phe Glu Gly Ser Leu Gly 
530 535 S40 

Glu Asp Asp Asn Leu Asp Phe Ser Gin Asn He Val Val Asp Lys Glu 
545 550 555 560 

Tyr Leu Leu Glu Lys He Ser Ser Leu Ala Arg Ser Ser Glu Arg Gly 
565 570 575 

Tyr He His Tyr He Val Gin Leu Gin Gly Asp Lys He Ser Tyr Glu 
580 585 590 

Ala Ala Cys Asn Leu Phe Ala Lys Thr Pro Tyr Asp Ser Val Leu Phe 
595 600 605 

Gin Lys Asn He Glu Asp Ser Glu He Ala Tyr Tyr Tyr Asn Pro Gly 
610 615 620 

Asp Gly Glu He Gin Glu He Asp Lys Tyr Lys He Pro Ser He He 
625 630 635 640 

Ser Asp AVg Pro Lys He Lys Leu Thr Phe He Gly His Glv Lys Asp 
645 650 ' 655 

Glu Phe Asn Thr Asp He Phe Ala Gly Phe Asp Val Asp Ser Leu Ser 
660 665 670 

Thr Glu He Glu Ala Ala He Asp Leu Ala Lys Glu Asp He Ser Pro 
675 680 685 
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15 



Lys Ser lie Glu lie Asn Leu Leu Gly Cys Asn Met Phe Ser Tvr Ser 
690 695 700 

He Asn Val Glu Glu Thr Tyr Pro Gly Lys Leu Leu Leu Lys Val Lvs 
705 710 715 720 

Asp Lys He Ser Glu Leu Met Pro Ser He Ser Gin Asp Ser He He 
725 730 735 

Val Ser Ala Asn Gin Tyr Glu Val Arg He Asn Ser Glu Gly Arq Ara 
740 745 750 

Glu Leu Leu Asp His Ser Gly Glu Trp He Asn Lys Glu Glu Ser He 
755 760 765 

He Lys Asp He Ser Ser Lys Glu Tyr He Ser Phe Asn Pro Lys Glu 
770 775 780 

Asn Lys He Thr Val Lys Ser Lys Asn Leu Pro Glu Leu Ser Thr Leu 
785 790 795 800 

Leu Gin Glu He Arg Asn Asn Ser Asn Ser Ser Asp He Glu Leu Glu 
805 810 815 

Glu Lys Val Met Leu Thr Glu Cys Glu He Asn Val He Ser Asn He 
820 825 830 

Asp Thr Gin He Val Glu Glu Arg He Glu Glu Ala Lys Asn Leu Thr 
835 840 845 

Ser Asp Ser He Asn Tyr He Lys Asp Glu Phe Lys Leu He Glu Ser 
8^0 855 860 

He Ser Asp Ala Leu Cys Asp Leu Lys Gin Gin Asn Glu Leu Glu Asp 
865 870 875 880 

Ser His Phe He Ser Phe Glu Asp He Ser Glu Thr Asp Glu Gly Phe 
885 890 895 

Ser He Arg Phe He Asn Lys Glu Thr Gly Glu Ser He Phe Val Glu 
900 905 910 

Thr Glu Lys Thr He Phe Ser Glu Tyr Ala Asn His He Thr Glu Glu 
915 920 925 

He Ser Lys He Lys Gly Thr He Phe Asp Thr Val Asn Gly Lys Leu 
930 935 940 

Val Lys Lys Val Asn Leu Asp Thr Thr His Glu Val Asn Thr Leu Asn 
545 950 955 960 

Ala Ala Phe Phe He Gin Ser Leu He Glu Tyr Asn Ser Ser Lys Glu 
965 970 975 

Ser Leu Ser Asn Leu Ser Val Ala Met Lys Val Gin Val Tyr Ala Gin 
980 985 990 

Leu Phe Ser Thr Gly Leu Asn Thr He Thr Asp Ala Ala Lys Val Val 
995 1000 1005 

Glu Leu Val Ser Thr Ala Leu Asp Glu Thr He Asp Leu Leu Pro Thr 
1010 1015 1020 

^ Leu Ser Glu Gly Leu Pro He He Ala Thr He He Asp Gly Val Ser 

1025 1030 1035 1040 

Leu Gly Ala Ala He Lys Glu Leu Ser Glu Thr Ser Asp Pro Leu Leu 
1045 1050 1055 



40 



45 



60 



- 252 - 



wo 98/08540 



PCT/US97/15394 



Arg Gin Glu He Glu Ala hys He Gly He Met Ala Val Asn Leu Thr 
1060 1065 1070 

Thr Ala Thr Thr Ala He He Thr Ser Ser Leu Gly He Ala Ser Gly 
1075 1080 1085 

Phe Ser He Leu Leu Val Pro Leu Ala Gly He Ser Ala Gly He Pro 
1090 1095 1100 

Ser^Leu Val Asn Asn Gl^^^eu Val Leu Arg Asp^Lys Ala Thr Lys Val 

Val Asp Tyr Phe Lys His Val Ser Leu Val Glu Thr Glu Gly Val Phe 
1125 1130 ii35 

Thr Leu Leu Asp Asp Lys He Met Met Pro Gin Asp Asp Leu Val He 
1140 1145 1150 

Ser Glu He Asp Phe Asn Asn Asn Ser He Val Leu Gly Lys Cvs Glu 
1155 1160 1165 

He Trp Arg Met Glu Gly Gly Ser Gly His Thr Val Thr Asp Asp He 
1170 1175 ^-^Qo 

Asp Hxs Phe Phe Ser Ala Pro Ser He Thr Tyr Arg Glu Pro His Leu 
^^^^ 1190 1195 1200 

ser He Tyr Asp Val Leu Glu Val Gin Lys Glu Glu Leu Asp Leu Ser 
1205 1210 1215 

Lys Asp Leu Met Val Leu Pro Asn Ala Pro Asn Arg Val Phe Ala Trp 
1220 1225 1230 

Glu Thr Gly Trp Thr Pro Gly Leu Arg Ser Leu Glu Asn Asp Gly Thr 
1235 1240 1245 

Lys Leu Leu Asp Arg He Arg Asp Asn Tyr Glu Glv Glu Phe Tyr Trp 
1250 1255 1260 

Arg Tyr Phe Ala Phe He Ala Asp Ala Leu He Thr Thr Leu Lys Pro 
l^o5 1270 1 f-ic J ^ 

^'^fu 1275 1280 

Arg Tyr Glu Asp Thr Asn He Arg He Asn Leu Asp Ser Asn Thr Arg 
1285 1290 1295 

Ser Phe He Val Pro He He Thr Thr Glu Tyr He Arg Glu Lys Leu 
1300 1305 1310 

Ser Tyr Ser Phe Tyr Gly Ser Gly Gly Thr Tyr Ala Leu Ser Leu Ser 
1315 1320 1325 

Gin Tyr Asn Met Gly He Asn He Glu Leu Ser Glu Ser Asp Val Trp 
1330 1335 1340 

He He Asp Val Asp Asn Val Val Arg Asp Val Thr He Glu Ser Asp 
^ 1350 1355 1360 

Lys He Lys Lys Gly Asp Leu He Glu Gly He Leu Ser Thr Leu Ser 
1365 1370 JL375 

He Glu Glu Asn Lys He He Leu Asn Ser His Glu He Asn Phe Ser 
1380 1385 1390 

Gly Glu Val Asn Gly Ser Asn Gly Phe Val Ser Leu Thr Phe Ser He 
1395 1400 1405 

Leu Glu Gly He Asn Ala He He Glu Val Asp Leu Leu Ser Lys Ser 
1410 1415 ^420 
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Tyr Lys Leu Leu He Ser Gly Glu Leu Lys lie Leu Met Leu Asn Ser 
1425 1430 1435 1440 

Asn His lie Gin Gin Lys He Asp Tyr He Gly Phe Asn Ser Glu Leu 
1445 1450 1455 

Gin Lys Asn He Pro Tyr Ser Phe Val Asp Ser Glu Gly Lys Glu Asn 
1460 1465 1470 

10 Gly Phe He Asn Gly Ser Thr Lys Glu Gly Leu Phe Val Ser Glu Leu 

1475 1480 1485 



\5 



30 



45 



60 



Pro Asp Val Val Leu He Ser Lys Val Tyr Met Asp Asp Ser Lys Pro 
1490 1495 1500 

Ser Phe Gly Tyr Tyr Ser Asn Asn Leu Lys Asp Val Lys Val He Thr 

1505 1510 1515 1520 



Lys Asp Asn Val Asn He Leu Thr Gly Tyr Tyr Leu Lys Asp Asp He 
20 1525 1530 1535 

Lys He Ser Leu Ser Leu Thr Leu Gin Asp Glu Lys Thr He Lys Leu 
1540 1545 1550 

25 Asn Ser Val His Leu Asp Glu Ser Gly Val Ala Glu Tie Leu Lvs Phe 

1555 1560 1565 



Met Asn Arg Lys Gly Asn Thr Asn Thr Ser Asp Ser Leu Met Ser Phe 
1570 1575 1580 

Leu Glu Ser Met Asn He Lys Ser He Phe Val Asn Phe Leu Gin Ser 
1585 1590 1595 1600 



Asn He Lys Phe He Leu Asp Ala Asn Phe He He Ser Gly Thr Thr 
35 1605 1610 1615 

Ser He Gly Gin Phe Glu Phe He Cys Asp Glu Asn Asp Asn He Gin 
1620 1625 1630 

40 Pro Tyr Phe He Lys Phe Asn Thr Leu Glu Thr Asn Tyr Thr Leu Tyr 

1635 1640 1645 



Val Gly Asn Arg Gin Asn Met He Val Glu Pro Asn Tyr Asp Leu Asp 
1650 1655 1660 

Asp Ser Gly Asp He Ser Ser Thr Val He Asn Phe Ser Gin Lys Tyr 
1665 1670 1675 1680 



Leu Tyr Gly He Asp Ser Cys Val Asn Lys Val Val lie Ser Pro Asn 
^0 1685 1690 1695 

He Tyr Thr Asp Giu He Asn He Thr Pro Val Tyr Glu Thr Asn Asn 
1700 1705 1710 

55 Thr Tyr Pro Glu Val He Val Leu Asp Ala Asn Tyr He Asn Glu Lys 

1715 1720 1725 



He Asn Val Asn He Asn Asp Leu Ser He Arg Tyr Val Trp Ser Asn 
1730 1735 1740 

Asp Gly ABn Asp Phe He Leu Met Ser Thr Ser Glu Glu Asn Lys Val 
1745 1750 1755 1760 



Ser Gin Val Lys He Arg Phe Val Asn Val Phe Lys Asp Lys Thr Leu 
65 1765 1770 1775 

Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp Lys Gin Asp Val Pro Val 
1780 1785 1790 
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Ser Glu lie lie Leu Ser Phe Thr Pro Ser Tyr Tyr Glu Asp Gly Leu 
1795 1800 1805 

He Gly Tyr Asp Leu Gly Leu Val Ser Leu Tyr Asn Glu Lys Phe Tyr 
1810 1815 1820 

He Asn Asn Phe Gly Met Met Val Ser Gly Leu He Tyr He Asn Asp 
1825 1830 1835 1840 

Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn Asn Leu He Thr Gly Phe 
1845 1850 1855 

Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn Pro He Asn Gly Gly 
1860 1865 1870 

Ala Ala Ser He Gly Glu Thr He He Asp Asp Lys Asn Tyr Tyr Phe 
1875 1880 1885 

Asn Gin Ser Gly Val Leu Gin Thr Gly Val Phe Ser Thr Glu Asp Gly 
1890 1895 1900 

Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu Asp Glu Asn Leu Glu Gly 
1905 1910 1915 1920 

Glu Ala He Asp Phe Thr Gly Lys Leu He He Asp Glu Asn He Tyr 
1925 1930 1935 

Tyr Phe Asp Asp Asn Tyr Arq Gly Ala Val Glu Trp Lys Glu Leu Asp 
1940 1945 1950 

Gly Glu Met His Tyr Phe Ser Pro Glu Thr Gly Lys Ala Phe Lys Gly 
1955 1960 1965 

Leu Asn Gin He Gly Asp Tyr Lys Tyr Tyr Phe Asn Ser Asp Gly Val 
1970 1975 1980 

Met Gin Lys Gly Phe Val Ser He Asn Asp Asn Lys His Tyr Phe Asp 
1985 1990 1995 2000 

Asp Ser Gly Val Met Lys Val Gly Tyr Thr Glu He Asp Gly Lys His 
2005 2010 2015 

Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin He Gly Val Phe Asn Thr 
2020 2025 2030 

Glu Asp Gly Phe Lys Tyr Phe Ala His His Asn Glu Asp Leu Gly Asn 
2035 2040 2045 

Glu Glu Gly Glu Glu He Ser Tyr Ser Gly He Leu Asn Phe Asn Asn 
2050 2055 2060 

Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr Ala Val Val Gly Trp Lys 
2065 ' 2070 2075 2080 

Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe Asp Glu Asp Thr Ala Glu 
2085 2090 2095 

Ala Tyr He Gly Leu Ser Leu He Asn Asp Gly Gin Tyr Tyr Phe Asn 
2100 2105 2110 

Asp Asp Gly He Met Gin Val Gly Phe Val Thr H f j Asn Asp Lys Vai 
2115 2120 2125 

Phe Tyr Phe Ser Asp Ser Gly He Jle Glu Ser Gly Val Gin Asn He 
2130 217.5 2140 

Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly He Val Gin He Gly 
2145 2150 2155 2160 

Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe Ala Pro Ala Asn Thr 
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2165 2170 2175 

Val Asn Asp Asn lie Tyr Gly Gin Ala Val Glu Tyr Ser Gly Leu Val 
^ 2180 2185 2190 

Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu Thr Tyr Thr lie Glu 
2195 2200 2205 

10 "^^"^ '^^'^ ^^"^ Tyr Tyr Phe 

2^10 2215 2220 

Asn Pro Glu Thr Lys Lys Ala Cys Lys Gly He Asn l.eu He Asp Asp 

2235 2240 

He Lys Tyr Tyr Phe Asp Glu Lys Gly He Met Arg Thr Gly Leu He 
2245 2250 2255 

Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn Glu Asn Gly Glu Met Gin 
2260 2265 2270 

Phe Gly Tyr He Asn He Glu Asp Lys Met Phe Tyr Phe Gly Glu Asp 
2275 2280 2285 

Gly Val Met Gin He Gly Val Phe Asn Thr Pro Asp Gly Phe Lys Tyr 
2290 2295 2300 

Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe Glu Gly Glu Ser He 
^•305 2310 2315 2320 

Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys Arg Tyr Tyr Phe Thr 
2325 2330 2335 

Asp Glu Tyr He Ala Ala Thr Gly Ser Val He He Asp Gly Glu Glu 
35 2345 2350 

Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu Val He Ser Glu 
2355 2360 2365 

(2) INFORMATION FOR SEQ ID NO : H : 

40 



15 



20 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
fD) TOPOLOGY: linear 



50 



60 



70 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TAGAAAAAAT GG CAAATGT 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
65 TTTCATCTTG TAGAGTCAAA G 

(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 22 base pairs 



256 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
GATGCCACAA GATGATTTAG TG 
{2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CTAATTGAGC TGTATCAGGA TC 

(2) INFORMATION FOR SEQ ID NO: 15: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CGGAATTCCT AGAAAAAATG GCAAATG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GCTCTAGAAT GACCATAAGC TAGCCA 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGGAATTCGA GTTGGTAGAA AGGTGGA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 7 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:18: 
CGGAATTCGG TTATTATCTT AAGGATG 27 
(2) INFORMATION FOR SEQ ID NO: 19: 



(j) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
»> IB) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



iii) MOLECULE TYPE: DNA (genomic) - 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGGAATTCTT GATAACTGGA TTTGTGAC 2 8 

^3 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 

(B) TYPE: ammo acid 

'^^^ (C) STRANDEDNESS: unknown 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Leu lie Thr Gly Phe Vai Thr Val Gly Asp Asp Lys Tyr Tyr Phe Asn 
1 5 10 15 

Pro lie Asn Gly Gly Ala Ala Ser He Gly Glu Thr He He Asp Asp 
20 25 30 

Lys Asn Tyr Tyr Phe Asn Gin Ser Gly Val Leu Gin Thr Gly Val Phe 
35 40 45 

Ser Thr Glu Asp Gly Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu Asp 

50 ^ 55 60 



Glu Asn Leu Glu Gly Glu Ala He Asp Phe Thr Gly Lys Leu He He 
65 70 75 80 

Asp Glu Asn He Tyr Tyr Phe Asp Asp Asn Tyr Arg Gly Ala Val Glu 
85 90 95 

35 Trp Lys Glu Leu Asp Gly Glu Met His Tyr Phe Ser Pro Glu Thr Gly 

100 105 110 



Lys Ala Phe Lys Gly Leu Asn Gin He Gly Asp Tyr Lys Tyr Tyr Phe 

115 120 125 

Asn Ser Asp Gly Val Met Gin Lys Gly Phe Val Ser He Asn Asp Asn 

130 135 140 



Lys His Tyr Phe Asp Asp Ser Gly Val Met Lys Val Gly Tvr Thr Glu 
145 150 155 ' 160 



70 



He Asp Gly Lys His Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin He 
165 170 175 

Gly Val Phe Asn Thr Glu Asp Gly Phe Lys Tyr Phe Ala His His Asn 
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lflf> 185 190 

Glu Asp Leu Gly Asn Glu Glu Gly Glu Glu He Ser Tyr Ser Gly He 
- 195 200 205 

Leu Asn Phe Asn Asn Lys He Tyr Tyr Phe Asp Asp Ser Phe Thr Ala 
210 215 220 

Val Val Gly Trp Lys Asp Leu Glu Asp Glv Ser Lys Tyr Tyr Phe Asp 
225 230 235 240 



15 



20 



30 



40 



45 



50 



Glu Asp Thr Ala Glu Ala Tyr He Gly Leu Ser Leu He Asn Asp Glv 
245 25r 255 

Gin Tyr Tyr Phe Asn Asp Asp Gly He Met Gin Val Gly Phe Val Thr 
260 265 270 

He Asn Asp Lys Val Phe Tyr Phe Ser Asp Ser Gly He He Glu Ser 
275 280 285 

Gly Val Gin Asn He Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn Gly 
290 295 300 

He Val Gin He Gly Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr Phe 
305 310 315 320 

Ala Pro Ala Asn Thr Val Asn Asp Asn He Tyr Gly Gin Ala Val Glu 
325 330 335 

Tyr Ser Gly Leu Val Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly Glu 
340 345 350 

Thr Tyr Thr He Glu Thr Gly Trp He Tvr Asp Met Glu Asn Glu Ser 
355 360 365 

Asp Lys Tyr Tyr Phe Asn Pro Glu Thr Lys Lys Ala Cys Lys Glv He 
370 375 380 

Asn Leu He Asp Asp He Lys Tvr Tyr Phe Asp Glu Lys Gly He Met 
385 390 395 400 

Arg Thr Gly Leu He Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn Glu 
405 410 ' 415 

Asn Gly Glu Met Gin Phe Gly Tyr He Asn He Glu Asp Lvs Met Phe 
420 425 430 

Tyr Phe Gly Glu Asp Gly Val Met Gin He Gly Val Phe Asn Thr Pro 
435 440 445 

Asp Gly Phe Lys Tyr Phe Ala His Gin Asn Thr Leu Asp Glu Asn Phe 
450 455 460 



Glu Gly Glu Ser He Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu Lys 
465 470 475 480 



60 



Arg Tyr Tyr Phe Thr Asp Glu Tyr He Ala Ala Thr Gly Ser Val He 
485 490 495 

He Asp Gly Glu Glu Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu 
500 505 5;|^o 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Ser Glu Glu Asn Lys Val Ser Gin Val Lys He Arg Phe Val Asn Val 
15 10 15 

Phe Lys Asp Lys Thr Leu Ala Asn Lys Leu Ser Phe Asn Phe Ser Asp 
20 25 30 

Lys Gin Asp Val Pro Val Ser Glu He lie Leu Ser Phe Thr Pro Ser 
35 40 45 

Tyr Tyr Glu Asp Gly Leu He Gly Tyr Asp Leu Gly Leu Val Ser Leu 
50 55 ftn 



Tyr Asn Glu Lys Phe Tyr He Asn Asn Phe Gly Met Met Val Ser Gly 
6b 70 75 80 

He Tyr He Asn Asp Ser Leu Tyr Tyr Phe Lys Pro Pro Val Asn 
85 90 95 

Asn Leu He Thr Gly Phe Val Thr Val Gly Asp Asp Lys Tyr Tyr Phe 
100 105 110 

Asn Pro He Asn Gly Gly Ala Ala Ser He Glv Glu Thr He He Asp 
115 120 ' 125 

Asp Lys Asn Tyr Tyr Phe Asn Gin Ser Gly Val Leu Gin Thr Gly Val 
130 135 140 

Phe Ser Thr Glu Asp Gly Phe Lys Tyr Phe Ala Pro Ala Asn Thr Leu 
145 ISO 155 X60 

Asp Glu Asn Leu Glu Gly Glu Ala He Asp Phe Thr Gly Lys Leu He 
165 170 175 

He Asp Glu Asn He Tyr Tyr Phe Asp Asp Asn Tyr Arg Glv Ala Val 
180 ° 185 190 

Glu Trp Lys Glu Leu Asp Gly Glu Met His Tyr Phe Ser Pro Glu Thr 
195 200 205 

Gly Lys Ala Phe Lys Gly Leu Asn Gin He Gly Asp Tyr Lys Tyr Tyr 
210 215 220 



Phe Asn Ser Asp Gly Val Met Gin Lys Gly Phe Val Ser He Asn Asp 
225 230 235 240 

Asn Lys His Tyr Phe Asp Asp Ser Gly Val Met Lys Val Gly Tyr Thr 
24S 250 255 

Glu He Asp Gly Lys His Phe Tyr Phe Ala Glu Asn Gly Glu Met Gin 
260 265 270 

11^ Gly Val Phe Asn Thr Glu Asp Gly Phe Lys Tyr Phe Ala Hxs His 
275 280 285 

Asn Glu Asp Leu Gly Asn Glu Glu Gly Glu Glu He Sor Tyr Ser Gly 
290 295 300 

lie Leu Asn Phe Asn Asn Lys Tie Tyr Tyr Phe Asp Asp Ser Phe Thr 
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305 310 315 320 

Ala Val Val Gly Trp Lys Asp Leu Glu Asp Gly Ser Lys Tyr Tyr Phe 
325 330 335 

Asp Glu Asp Thr Ala Glu Ala Tyr lie Gly Leu Ser Leu lie Asn Asp 
340 345 350 

Gly Gin Tyr Tyr Phe Asn Asp Asp Gly He Met Gin Val Gly Phe Val 
355 360 365 

Thr He Asn Asp Lys Val Phe Tyr Phe Ser Asp Ser Gly He Xle Glu 
370 375 380 

Ser Gly Val Gin Asn lie Asp Asp Asn Tyr Phe Tyr He Asp Asp Asn 
385 390 395 400 

Gly He Val Gin He Gly Val Phe Asp Thr Ser Asp Gly Tyr Lys Tyr 
405 410 415 

Phe Ala Pro Ala Asn Thr Val Asn Asp Asn He Tyr Gly Gin Ala Val 
420 425 430 

Glu Tyr Ser Gly Leu Val Arg Val Gly Glu Asp Val Tyr Tyr Phe Gly 
435 440 445 

Glu Tnr Tyr Thr He Glu Thr Gly Trp He Tyr Asp Met Glu Asn Glu 
450 455 460 

Ser Asp Lys Tyr Tyr Phe Asn Pro Glu Thr Lys Lys Ala Cys Lys Gly 
465 470 475 480 

He Asn Leu He Asp Asp He Lys Tyr Tyr Phe Asp Glu Lys Gly He 
485 490 495 

Met Arg Thr Gly Leu He Ser Phe Glu Asn Asn Asn Tyr Tyr Phe Asn 
SOO 505 510 

Glu Asn Gly Glu Met Gin Phe Gly Tyr He Asn He Glu Asp Lys Met 
515 520 525 

Phe Tyr Phe Gly Glu Asp Gly Val Met Gin He Gly Val Phe Asn Thr 
530 535 540 

Pro Asp Gly Phe Lys Tvr Phe Ala His Gin Asn Thr Leu Asp Glu Asn 
545 550 555 560 

Phe Glu Gly Glu Ser He Asn Tyr Thr Gly Trp Leu Asp Leu Asp Glu 
565 570 575 

Lys Arg Tyr Tyr Phe Thr Asp Glu Tyr He Ala Ala Thr Gly Ser Val 
580 585 590 

He He Asp Gly Glu Glu Tyr Tyr Phe Asp Pro Asp Thr Ala Gin Leu 
595 600 605 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(D) LOCATION: 1..1314 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATG GCT CGT CTG CTG TCT ACC TTC ACT GAA TAC ATC AAG AAC ATC ATC 4 8 

Met Ala Arg Leu Leu Ser Thr Phe Thr Glu Tyr lie Lys Asn lie He 

i 5 10 15 

AAT ACC TCC ATC CTG AAC CTG CGC TAC GAA TCC AAT CAC CTG ATC GAC 96 

Asn Thr Ser He Leu Asn Leu Arg Tyr Glu Ser Asn His Leu He Asp 

20 25 30 

CTG TCT CGC TAC GCT TCC JKAA ATC AAC ATC GGT TCT AAA GTT AAC TTC 14 4 

Leu Ser Arg Tyr Ala Ser Lys He Asn He Gly Ser Lys Val Asn Phe 
35 40 45 

15 GAT CCG ATC GAC AAG AAT CAG ATC CAG CTG TTC AAT CTG GAA TCT TCC 192 

Asp Pro He Asp Lys Asn Gin He Gin Leu Phe Asn Leu Glu Ser Ser 
50 55 60 



10 



20 



30 



40 



50 



65 



70 



AAA ATC GAA GTT ATC CTG AAG AAT GCT ATC GTA TAC AAC TCT ATG TAC 24 0 

Lys He Glu Val He Leu Lys Asn Ala He Val Tyr Asn Ser Met Tyr 
65 70 75 80 



GAA AAC TTC TCC ACC TCC TTC TGG ATC CGT ATC CCG AAA TAC TTC AAC 
Glu Asn Phe Ser Thr Ser Phe Trp He Arg He Pro Lys Tyr Phe Asn 
85 90 95 



288 



TCC ATC TCT CTG AAC AAT GAA TAC ACC ATC ATC AAC TGC ATG GAA AAC 3 36 

Ser He Ser Leu Asn Asn Glu Tyr Thr He He Asn Cys Met Glu Asn 
100 105 110 

AAT TCT GGT TGG AAA GTA TCT CTG AAC TAC GGT GAA ATC ATC TGG ACT 3 84 

Asn Ser Gly Trp Lys Val Ser Leu Asn Tyr Gly Glu He He Trp Thr 
115 120 125 

-'>5 CTG CAG GAC ACT CAG GAA ATC AAA CAG CGT GTT GTA TTC AAA TAC TCT 4 32 

Leu Gin Asp Thr Gin Glu He Lys Gin Arg Val Val Phe Lys Tyr Ser 

130 135 140 



CAG ATG ATC AAC ATC TCT GAC TAC ATC AAT CGC TGG ATC TTC GTT ACC 480 
Gin Met He Asn He Ser Asp Tyr He Asn Arg Trp He Phe Val Thr 
145 150 155 160 



ATC ACC AAC AAT CGT CTG AAT AAC TCC AAA ATC TAC ATC AAC GGC CGT 528 
H<^ Thr Asn Asn Arg Leu Asn Asn Ser Lys He Tyr He Asn Gly Arg 
165 170 175 



CTG ATC GAC CAG AAA CCG ATC TCC AAT CTG GGT AAC ATC CAC GCT TCT 576 

Leu He Asp Gin Lys Pro He Ser Asn Leu Gly Asn He His Ala Ser 
180 185 190 

AAT AAC ATC ATG TTC AAA CTG GAC GGT TGT CGT GAC ACT CAC CGC TAC 624 

Asn Asn He Met Phe Lys Leu Asp Gly Cys Arg Asp Thr His Arg Tyr 
195 200 205 



>3 ATC TGG ATC AAA TAC TTC AAT CTG TTC GAC AAA GAA CTG AAC GAA AAA 672 

He Trp He Lys Tyr Phe Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys 

210 215 220 

GAA ATC AAA GAC CTG TAC GAC AAC CAG TCC AAT TCT GGT ATC CTG AAA 720 

60 Glu He Lys Asp Leu Tyr Asp Asn Gin Ser Asn Ser Gly He Leu Lys 

225 - 230 ■ 235 240 



GAC TTC TGG GGT GAC TAC CTG CAG TAC GAC AAA CCG TAC TAC ATG CTG 76 8 

Asp Phe Trp Gly Asp Tyr Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu 
245 250 255 

AAT CTG TAC GAT CCG AAC AAA TAC GTT GAC GTC AAC AAT GTA GGT ATC 816 

Asn Leu Tyr Asp Pro Asn Lys Tyr Val Asp Val Asn Asn Val Gly He 

260 265 270 
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20 



CGC GGT TAC ATG TAC CTG AAA GGT CCG CGT GGT TCT GTT ATG ACT ACC 864 

Arg Gly Tyr Met Tyr Leu Lys Gly Pro Arg Gly Ser Val Met Thr Thr 

275 280 285 

AAC ATC TAC CTG AAC TCT TCC CTG TAC CGT GGT ACC AAA TTC ATC ATC 912 

Asn lie Tyr Leu Asn Ser Ser Leu Tyr Arg Gly Thr Lys Phe He He 
290 295 300 

AAG AAA TAC GCG TCT GGT AAC AAG GAC AAT ATC GTT CGC AAC AAT GAT 96 0 

Lys Lys Tyr Ala Ser Gly Asn Lys Asp Asn He Val Arg Asn Asn Asp 
305 310 315 320 



CGT GTA TAC ATC AAT GTT GTA GTT AAG AAC AAA GAA TAC CGT CTG GCT 1008 
Arg Val Tyr He Asn Val Val Val Lys Asn Lys Glu Tyr Arg Leu Ala 
15 325 330 335 



ACC 7VAT GCT TCT CAG GCT GGT GTA GAA AAG ATC TTG TCT GCT CTG GAA 1056 
Thr Asn Ala Ser Gin Ala Gly Val Glu Lys He Leu Ser Ala Leu Glu 
340 345 350 

ATC CCG GAC GTT GGT AAT CTG TCT CAG GTA GTT GTA ATG AAA TCC AAG 1104 
He Pro Asp Val Gly Asn Leu Ser Gin Val Val Val Met Lys Ser Lys 
355 360 365 

25 AAC GAC CAG GGT ATC ACT AAC AAA TGC AAA ATG AAT CTG CAG GAC AAC 1152 

Asn Asp Gin Gly He Thr Asn Lvs Cys Lys Met Asn Leu Gin Asp Asn 
370 375 380 

AAT GGT AAC GAT ATC GGT TTC ATC GGT TTC CAC CAG TTC AAC AAT ATC 1200 
30 Asn Gly Asn Asp He Gly Phe He Gly Phe His Gin Phe Asn Asn He 

385 390 395 400 

GCT AAA CTG GTT GCT TCC AAC TGG TAC AAT CGT CAG ATC GAA CGT TCC 124 8 

Ala Lys Leu Val Ala Ser Asn Trp Tyr Asn Arg Gin He Giu Arg Ser 
3:* 405 410 415 

TCT CGC ACT CTG GGT TGC TCT TGG GAG TTC ATC CCG GTT GAT GAC GGT 12 96 

Ser Arg Thr Leu Gly Cys Ser Trp Glu Phe He Pro Val Asp Asp Giv 
420 425 430 

40 

TGG GGT GAA CGT CCG CTG TAACCCGGGA AAGCTT 133 0 

Trp Giy Giu Arg Pro Leu 
435 

45 C^) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 ammo acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

iii) MOLECULE TYPE; protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

55 

Met Ala Arg Leu Leu Ser Thr Phe Thr Glu Tyr He Lys Asn He He 
15 10 15 

Asn Thr Ser He Leu Asn Leu Arg Tyr Glu Ser Asn His Leu He Asp 
(^0 20 25 30 

Leu Ser Arg Tyr Ala Ser Lys He Asn He Gly Ser Lys Val Asn Phe 
35 40 45 

65 Asp Pro He Asp Lys Asn Gin He Gin Leu Phe Asn Leu Glu Ser Ser 

50 55 60 



70 



Lys He Glu Val He Leu Lys Asn Ala lie Val Tyr Asn Ser Met Tvr 
65 70 75 80 
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Glu Asn Phe Ser Thr Ser Phe Trp He Arg He Pro Lys Tyr Phe Asn 
95 90 95 

Ser He Ser Leu Asn Asn Glu Tyr Thr He ^Ile Asn Cys Met Glu Asn 
5 100 105 110 

Asn Ser Gly Trp Lys Val Ser Leu Asn Tyr Gly Glu He He Trp Thr 
115 120 125 

10 Leu Gin Asp Thr Gin Glu He Lys Gin Arg Val Val Phe Lys Tyr Ser 

130 135 140 



15 



45 



60 



Gin Met He Asn He Ser Asp Tyr He Asn Arg Trp He Phe Val Thr 

145 150 155 160 

He Thr Asn Asn Arg Leu Asn Asn Ser Lys He Tyr He Asn Gly Arg 

165 170 175 



Leu He Asp Gin Lys Pro He Ser Asn Leu Gly Asn He His Ala Ser 
20 180 185 190 

Asn Asn He Met Phe Lys Leu Asp Gly Cys Arg Asp Thr His Arg Tyr 
195 200 205 

25 He Trp He Lys Tyr Phe Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys 

210 215 220 

Glu He Lys Asp Leu Tyr Asp Asn Gin Ser Asn Ser Gly He Leu Lys 
225 230 235 240 

30 

Asp Phe Trp Gly Asp Tyr Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu 
245 250 255 

Asn Leu Tyr Asp Pro Asn Lys Tyr Val Asp Val Asn Asn Val Gly He 
35 260 265 270 

Arg Gly Tyr Met Tyr Leu Lys Gly Pro Arg Gly Ser Val Met Thr Thr 
275 280 285 

Asn He Tyr Leu Asn Ser Ser Leu Tyr Arg Gly Thr Lys Phe He He 
290 295 300 



Lys Lys Tyr Ala Ser Gly Asn Lys Asp Asn He Val Arg Asn Asn Asp 

305 ^ 310 315 320 

Arg Val Tyr He Asn Val Val Val Lys Asn Lys Glu Tyr Arg Leu Ala 

325 330 335 



Thr Asn Ala Ser Gin Ala Gly Val Glu Lys He Leu Ser Ala Leu Glu 
50 340 345 350 

He Pro Asp Val Gly Asn Leu Ser Gin Val Val Val Met Lys Ser Lys 
355 360 365 

55 Asn Asp Gin Gly He Thr Asn Lys Cys Lys Met Asn Leu Gin Asp Asn 

370 375 380 



Asn Gly Asn Asp He Gly Phe He Gly Phe His Gin Phe Asn Asn He 

385 390 395 400 

Ala Lys Lfeu Val Ala Ser Asn Trp Tyr Asn Arg Gin He Glu Arg Ser 

405 410 415 



Ser Arg Thr Leu Gly Cys Ser Trp Glu Phe He Pro Val Asp Asp Gly 
65 420 425 430 

Trp Gly Glu Arg Pro Leu 
435 

70 (2) INFORMATION FOR SEQ ID NO: 24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknovni 

(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
1 S 10 15 



He Glu Gly Arg His Met Ala 
15 20 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH; 1402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

23 tiL) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 1386 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATG GGC CAT CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT 4 8 

Met Gly His His His His His His His His His His Ser Ser Gly His 
35 1 5 10 15 

ATC GAA GGT CGT CAT ATG GCT AGC ATG GCT CGT CTG CTG TCT ACC TTC 96 
lie Glu Gly Arg His Met Ala Ser Met Ala Arg Leu Leu Ser Thr Phe 
20 25 30 

40 

ACT GAA TAC ATC AAG AAC ATC ATC AAT ACC TCC ATC CTG AAC CTG CGC 144 
Thr Glu Tyr He Lys Asn He He Asn Thr Ser He Leu Asn Leu Arg 
35 40 45 

45 TAC GAA TCC AAT CAC CTG ATC GAC CTG TCT CGC TAC GCT TCC AAA ATC 192 

Tyr Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys He 
50 55 60 

AAC ATC GGT TCT AAA GTT AAC TTC GAT CCG ATC GAC AAG AAT CAG ATC 24 0 

50 Asn He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He 

65 70 75 80 

CAG CTG TTC AAT CTG GAA TCT TCC AAA ATC GAA GTT ATC CTG AAG AAT 288 
Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn 
55 85 90 95 

GCT ATC GTA TAC AAC TCT ATG TAC GAA AAC TTC TCC ACC TCC TTC TGG 33 6 

Ala lie Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp 
100 ' 105 110 



ATC CGT ATC CCG AAA TAC TTC AAC TCC ATC TCT CTG AAC AAT GAA TAC 384 
He Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tvr 
lis 120 125 



65 ACC ATC ATC AAC TGC ATG GAA AAC AAT TCT GGT TGG AAA GTA TCT CTG 4 32 

Thr He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu 
130 135 140 

AAC TAC GGT GAA ATC ATC TGG ACT CTG CAG GAC ACT CAG GAA ATC AAA 480 

70 Asn Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu He Lys 
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60 



65 



150 155 160 

CAG CGT GTT GTA TTC AAA TAG TCT CAG ATG ATC AAC ATC TCT GAC TAG 528 
Gin Arg Val Val phe l>ys Tyr Ser Gin Met He Asn He Ser Asp Tyr 
165 170 175 

ATC AAT CGC TGG ATC TTC GTT ACC ATC ACC AAC AAT CGT CTG AAT AAC 576 
He Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn 
IBO 185 190 

TCC AAA ATC TAC ATC AAC GGC CGT CTG ATC GAC CAG AAA CCG ATC TCC 624 
Ser Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro He Ser 
195 200 205 



1> AAT CTG GGT AAC ATC CAC GCT TCT AAT AAC ATC ATG TTC AAA CTG GAC 

Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asp 
-10 215 220 



672 



GGT TGT CGT GAC ACT CAC CGC TAC ATC TGG ATC AAA TAC TTC AAT CTG 720 

Gly Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Asn Leu 
-25 220 235 240 

TTC GAC AAA GAA CTG AAC GAA AAA GAA ATC AAA GAC CTG TAC GAC AAC 768 

Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn 
245 250 255 

CAG TCC AAT TCT GGT ATC CTG AAA GAC TTC TGG GGT GAC TAC CTG CAG 816 

Gin Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin 
260 265 270 

TAC GAC AAA CCG TAC TAC ATG CTG AAT CTG TAC GAT CCG AAC AAA TAC 864 

Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lys Tyr 
275 280 285 



GTT GAC GTC AAC AAT GTA GGT ATC CGC GGT TAC ATG TAC CTG AAA GGT 912 
Val Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu Lys Gly 
^90 295 300 



CCG CGT GGT TCT GTT ATG ACT ACC AAC ATC TAC CTG AAC TCT TCC CTG 960 

Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu 
305 310 315 320 

TAC CGT GGT ACC AAA TTC ATC ATC AAG AAA TAC GCG TCT GGT AAC AAG 1008 

Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lys 
325 330 335 

GAC AAT ATC GTT CGC AAC AAT GAT CGT GTA TAC ATC AAT GTT GTA GTT 1056 

Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val 
340 345 350 

AAG AAC AAA GAA TAC CGT CTG GCT ACC AAT GCT TCT CAG GCT GGT GTA 1104 

Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val 
355 360 365 



:>:> GAA AAG ATC TTG TCT GCT CTG GAA ATC CCG GAC GTT GGT AAT CTG TCT 1152 

Glu Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn Leu Ser 
370 375 380 



CAG GTA GTT GTA ATG AAA TCC AAG AAC GAC CAG GGT ATC ACT AAC AAA 12 00 

Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr Asn Lys 
385 ' 390 395 ' 400 

TGC AAA ATG AAT CTG CAG GAC AAC AAT GGT AAC GAT ATC GGT TTC ATC 124 8 

Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe He 
405 410 415 
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GGT TTC CAC CAG TTC AAC AAT ATC GCT AAA CTG GTT OCT TCC AAC TGG 12 96 

Gly Phe His Gin Phe Asn Asn lie Ala Lys Leu Val Ala Ser Asn Trp 
420 425 430 

TAC AAT CGT CAG ATC GAA CGT TCC TCT CGC ACT CTG GGT TGC TCT TGG 1344 
Tyr Asn Arg Gin lie Glu Arg Ser Ser Arg Thr Leu Gly Cys Ser Trp 
435 440 445 

GAG TTC ATC" CCG GTT GAT GAC GGT TGG GGT GAA CGT CCG CTG 1386 
Glu Phe lie Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
4^0 455 460 

TAACCCGGGA AAGCTT 14 02 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 amino acids 

(B) TYPE: amino acid 
{ D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

ixx) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Gly' His His His His His His His His His His Ser Ser Gly His 
1 5 10 15 

lie Glu GIv Arg His Met Ala Ser Met Ala Arg Leu Leu Ser Thr Phe 
20 25 30 

Thr Glu Tvr lie Lys Asn lie lie Asn Thr Ser lie Leu Asn Leu Arg 
35 40 45 

Tyr Glu Ser Asn His Leu lie Asp Leu Ser Arg Tyr Ala Scr Lys lie 

50 SS 60 



Asn lie Gly Ser Lys Val Asn Phe Asp Pro lie Asp Lys Asn Gin lie 

65 "70 75 80 

Gin Leu Phe Asn Leu Glu Ser Ser Lys lie Glu Val lie Leu Lys Asn 

85 90 95 



Ala lie Val Tyr 
100 

lie Arg lie Pro 
115 

Thr lie lie Asn 
130 

Asn Tyr Gly Glu 
145 

Gin Arg Val Val 



lie Asn Arg Trp 
180 



Asn Ser Met Tyr 



Lys Tyr Phe Asn 
120 

Cys Met Glu Asn 
135 

lie lie Trp Thr 
150 

Phe Lys Tyr Ser 
165 

lie Phe Val Thr 



Glu Asn Phe Ser 
105 

Ser lie Ser Leu 



Asn Ser Gly Trp 
140 

Leu Gin Asp Thr 
155 

Gin Met lie Asn 
170 

lie Thr Asn Asn 
18S 



Thr Scr Phe Trp 
110 

Asn Asn Glu Tyr 
125 

Lys Val Ser Leu 



Gin Glu lie Lys 
160 

lie Ser Asp Tyr 
175 

Arg Leu Asn Asn 
190 
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15 



40 



45 



50 



Ser Lys lie Tyr He Asn Gly Arg Leu lie Asp Gin Lys Pro He Ser 

195 200 205 

Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asp 
210 215 220 

Gly Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Asn Leu 
225 230 235 240 

Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn 
245 250 255 

Gin Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin 
260 265 270 

Tyr Asp Lys Pro Tyr Tyr Net Leu Asn Leu Tyr Asp Pro Asn Lys Tyr 

275 280 285 



^-'^1 Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu Lys Gly 
-<J 290 295 300 

Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu 
305 310 315 320 

2:> Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lvs 

325 330 335 

Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val 
340 345 350 

Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val 
355 360 365 

Glu Lys He Leu Ser Ala Leu Glu He Pro Asp Val Glv Asn Leu Ser 
-^^ 370 375 380 

Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr Asn Lvs 
385 390 395 400 



60 



r.'ys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe He 
405 410 415 

Gly Phe His Gin Phe Asn Ann He Ala Lys Leu Val Ala Ser Asn Trp 
420 425 430 

Tyr Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cys Ser Trp 
435 440 445 

Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
450 455 460 

i2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3888 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG CAA TTT GTT AAT AAA CAA TTT AAT TAT AAA GAT CCT GTA AAT GGT 4 8 

Met Gin Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly 

15 10 15 

GTT GAT ATT GCT TAT ATA AAA ATT CCA AAT GTA GGA CAA ATG CAA CCA 96 

Val Asp lie Ala Tyr lie Lys He Pro Asn Val Gly Gin Met Gin Pro 

20 25 30 

15 GTA PiAA GCT TTT AAA ATT CAT AAT AAA ATA TGG GTT ATT CCA GAA AGA 144 

Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro Glu Arg 

35 , 40 45 

GAT ACA TTT ACA AAT CCT GAA GAA GGA GAT TTA AAT CCA CCA CCA GAA 192 

20 Asp Thr Phe Thr Asn Pro Glu Glu Gly Asp Leu Asn Pro Pro Pro Glu 

50 5S 60 

GCA AAA CAA GTT CCA GTT TCA TAT TAT GAT TCA ACA TAT TTA AGT ACA 24 0 

Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr 

25 65 70 ' 75 80 

GAT AAT GAA AAA GAT AAT TAT TTA AAG GGA GTT ACA AAA TTA TTT GAG 28 8 

Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu 

85 90 95 

30 

AGA ATT TAT TCA ACT GAT CTT GGA AGA ATG TTG TTA ACA TCA ATA GTA 3 36 

Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser He Val 

100 105 110 

35 AGG GGA ATA CCA TTT TGG GGT GGA AGT ACA ATA GAT ACA GAA TTA AAA 384 

Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys 

115 120 125 

GTT ATT GAT ACT AAT TGT ATT AAT GTG ATA CAA CCA GAT GGT AGT TAT 4 32 

40 Val He Asp Thr Asn Cys He Asn Val He Gin Pro Asp Gly Ser Tyr 

130 135 140 

AGA TCA GAA GAA CTT AAT CTA GTA ATA ATA GGA CCC TCA GCT GAT ATT 4 80 

Arg Ser Glu Glu Leu Asn Leu Val He lie Gly Pro Ser Ala Asp He 

45 145 150 155 160 

ATA CAG TTT GAA TGT AAA AGC TTT GGA CAT GAA GTT TTG AAT CTT ACG 528 

He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn Leu Thr 

165 170 175 

50 

CGA AAT GGT TAT GGC TCT ACT CAA TAC ATT AGA TTT AGC CCA GAT TTT 576 

Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe 

180 185 190 

55 ACA TTT GGT TTT GAG GAG TCA CTT GAA GTT GAT ACA AAT CCT CTT TTA 6 24 

Thr Phe Gly Phe Glu Glu Ser Leu Glu Val Asp Thr Asn Pro Leu Leu 

195 200 205 

GGT GCA GGC AAA TTT GCT ACA GAT CCA GCA GTA ACA TTA GCA CAT GAA 672 

60 Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu 

210 * 215 220 

CTT ATA CAT GCT GGA CAT AGA TTA TAT GGA ATA GCA ATT AAT CCA AAT 720 

Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn 

65 225 230 235 240 

AGG GTT TTT AAA GTA AAT ACT AAT GCC TAT TAT GAA ATG AGT GGG TTA 768 

Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu 

245 250 255 
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GAA GTA AGC TTT GAG GAA CTT AG A ACA TTT GGG GGA CAT GAT GCA AAG 816 
Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lvs 
260 265 270 

5 TTT ATA GAT AGT TTA CAG GAA AAC GAA TTT CGT CTA TAT TAT TAT AAT 864 

Phe lie Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn 
275 280 285 

AAG TTT AAA GAT ATA GCA AGT ACA CTT AAT AAA GCT AAA TCA ATA GTA 912 
10 Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val 

290 295 300 

GGT ACT ACT GCT TCA TTA CAG TAT ATG AAA AAT GTT TTT AAA GAG AAA 96 0 

Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Glu Lys 
1> 305 310 315 320 

TAT CTC CTA TCT GAA GAT ACA TCT GGA AAA TTT TCG GTA GAT AAA TTA lOOB 

Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu 
325 330 335 

20 

AAA TTT GAT AAG TTA TAC AAA ATG TTA ACA GAG ATT TAG ACA GAG GAT 1056 

Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp 
340 345 350 

25 AAT TTT GTT AAG TTT TTT AAA GTA CTT AAC AGA AAA ACA TAT TTG AAT Ii04 

Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn 
355 360 365 

TTT GAT AAA GCC GTA TTT AAG ATA AAT ATA GTA CCT AAG GTA JUkT TAC 1152 
30 Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val Asn Tyr 

370 375 380 

ACA ATA TAT GAT GGA TTT AAT TTA AGA AAT ACA AAT TTA GCA GCA AAC 1200 
Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn 
35 385 390 395 400 

TTT AAT GGT CAA AAT ACA GAA ATT AAT AAT ATG AAT TTT ACT AAA CTA 1248 

Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr Lys Leu 
405 410 4i5 

40 

AAA AAT TTT ACT GGA TTG TTT GAA TTT TAT AAG TTG CTA TGT GTA AGA 12 96 

Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Arg 
420 425 430 

45 GGG ATA ATA ACT TCT AAA ACT AAA TCA TTA GAT AAA GGA TAC AAT AAG 1344 

Gly He He Thr Ser Lys Thr Lys Ser Leu Asp Lys Gly Tyr Asn Lvs 
435 440 445 

GCA TTA AAT GAT TTA TGT ATC AAA GTT AAT AAT TGG GAC TTG TTT TTT 13 92 

^0 Ala Leu Asn Asp Leu Cys He Lys Val Asn Asn Trp Asp Leu Phe Phe 

450 455 460 

AGT CCT TCA GAA GAT AAT TTT ACT AAT GAT CTA AAT AAA GGA GAA GAA 1440 
Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu 
55 465 470 475 480 

ATT ACA TCT GAT ACT AAT ATA GAA GCA GCA GAA GAA AAT ATT AGT TTA 1488 
He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn He Ser Leu 
4B5 490 495 



60 



GAT TTA ATA CAA CAA TAT TAT TTA ACC TTT AAT TTT GAT AAT GAA CCT 153 6 

Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro 
500 505 510 



65 GAA AAT ATT TCA ATA GAA AAT CTT TCA AGT GAC ATT ATA GGC CAA TTA 1584 

Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu 
515 520 525 

GAA CTT ATG CCT AAT ATA GAA AGA TTT CCT AAT GGA AAA AAG TAT GAG 16 32 

70 Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu 
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30 



530 535 540 

TTA GAT AAA TAT ACT ATG TTC CAT TAT CTT CGT GCT CAA GAA TTT GAA 1680 
Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu 
545 550 555 560 

CAT GGT AAA TCT AGG ATT GCT TTA ACA AAT TCT GTT AAC GAA GCA TTA 1728 
His Gly Lys Ser Arg lie Ala Leu Thr Asn Ser Val Asn Glu Ala Leu 
565 570 575 

TTA AAT CCT AGT CGT GTT TAT ACA TTT TTT TCT TCA GAC TAT GTA AAG 1776 
Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys 
580 585 590 

AAA GTT AAT AAA GCT ACG GAG GCA GCT ATG TTT TTA GGC TGG GTA GAA 1824 
Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu 
595 600 605 

CAA TTA GTA TAT GAT TTT ACC GAT GAA ACT AGC GAA GTA AGT ACT ACG 1872 
Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 
610 615 620 

GAT AAA ATT GCG GAT ATA ACT ATA ATT ATT CCA TAT ATA GGA CCT GCT 1920 
Asp Lys He Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala 
625 630 635 640 

TTA AAT ATA GGT AAT ATG TTA TAT AAA GAT GAT TTT GTA GGT GCT TTA 196 8 

Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu 
645 650 655 

ATA TTT TCA GGA GCT GTT ATT CTG TTA GAA TTT ATA CCA GAG ATT GCA 2016 
He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala 
660 665 670 

35 ATA CCT GTA TTA GGT ACT TTT GCA CTT GTA TCA TAT ATT GCG AAT AAG 2064 

He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys 
675 680 685 

GTT CTA ACC GTT CAA ACA ATA GAT AAT GCT TTA AGT AAA AGA AAT GAA 2112 
40 Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu 

690 695 700 

AAA TGG GAT GAG GTC TAT AAA TAT ATA GTA ACA AAT TGG TTA GCA AAG 2160 
Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lvs 
45 705 710 " 715 720 

GTT AAT ACA CAG ATT GAT CTA ATA AGA AAA AAA ATG AAA GAA GCT TTA 2208 

Val Asn Thr Gin He Asp Leu Tie Arg Lys Lys Met Lys Glu Ala Leu 

725 730 735 

50 

GAA AAT CAA GCA GAA GCA ACA AAG GCT ATA ATA AAC TAT CAG TAT AAT 2256 

Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn 
740 745 750 

55 CAA TAT ACT GAG GAA GAG AAA AAT AAT ATT AAT TTT AAT ATT GAT GAT 2304 

Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp 
755 760 765 

TTA AGT TCG A^lA CTT AAT GAG TCT ATA AAT AAA GGT ATG ATT AAT ATA 2352 
60 Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He 

770 ' 775 780 

AAT AAA TTT TTG AAT CAA TGC TCT GTT TCA TAT TTA ATG AAT TCT ATG 2400 
Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met 
65 785 790 795 800 

ATC CCT TAT GGT GTT AAA CGG TTA GAA GAT TTT GAT GCT AGT CTT AAA 244 8 

He Pro Tyr Gly Val Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys 
805 810 815 
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GAT GCA TTA TTA AAG TAT ATA TAT GAT AAT AGA GGA ACT TTA ATT GGT 24 96 

Asp Ala Leu Leu Lys Tyr lie Tyr Asp Asn Arg Gly Thr Leu lie Gly 
820 825 830 

5 CAA GTA GAT AGA TTA AAA GAT AAA GTT AAT AAT ACA CTT AGT ACA GAT 2544 

Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp 
835 840 845 

ATA CCT TTT CAG CTT TCC AAA TAC GTA GAT AAT CAA AGA TTA TTA TCT 2 592 

10 He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser 

850 855 860 

ACA TTT ACT GAA TAT ATT AAG AAT ATT ATT AAT ACT TCT ATA TTG AAT 264 0 

Thr Phe Thr Glu Tyr He Lys Asn He He Asn Thr Ser He Leu Asn 
13 865 870 875 880 

TTA AGA TAT GAA AGT AAT CAT TTA ATA GAC TTA TCT AGG TAT GCA TCA 2688 

Leu Arg Tyr Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser 
885 890 895 

20 

AAA ATA AAT ATT GGT AGT AAA GTA AAT TTT GAT CCA ATA GAT AAA AAT 2 73 6 

Lys He Asn He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn 
900 905 910 

25 CAA ATT CAA TTA TTT AAT TTA GAA AGT AGT AAA ATT GAG GTA ATT TTA 2 784 

Gin He Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu 
915 920 925 

AAA AAT GCT ATT GTA TAT AAT AGT ATG TAT GAA AAT TTT AGT ACT AGC 28 32 

30 Lys Asn Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser 

930 935 940 

TTT TGG ATA AGA ATT CCT AAG TAT TTT AAC AGT ATA AGT CTA AAT AAT 2 880 

Phe Trp He Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn 
3:> 945 950 955 960 

GAA TAT ACA ATA ATA AAT TGT ATG GAA AAT AAT TCA GGA TGG AAA GTA 2 92 8 

Glu Tyr Thr He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val 
965 970 975 

40 

TCA CTT AAT TAT GGT GAA ATA ATC TGG ACT TTA CAG GAT ACT CAG GAA 2 976 

Ser Leu Asn Tyr Gly Glu lie He Trp Thr Leu Gin Asp Thr Gin Glu 
980 985 990 

45 ATA AAA CAA AGA GTA GTT TTT AAA TAC AGT CAA ATG ATT AAT ATA TCA 3 024 

He Lys Gin Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser 
995 1000 1005 

GAT TAT ATA AAC AGA TGG ATT TTT GTA ACT ATC ACT AAT AAT AGA TTA 3 07 2 

>0 Asp Tyr He Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu 

1010 1015 1020 

AAT AAC TCT AAA ATT TAT ATA AAT GGA AGA TTA ATA GAT CAA AAA CCA 3120 
Asn Asn Ser Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro 
^5 1025 1030 1035 1040 

ATT TCA AAT TTA GGT AAT ATT CAT GCT AGT AAT AAT ATA ATG TTT AAA 316 8 

He Ser Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys 
1045 1050 1055 



60 



TTA GAT GGT TGT AGA GAT ACA CAT AGA TAT ATT TGG ATA AAA TAT TTT 3216 
Leu Asp Gly Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe 
1060 1065 1070 



65 AAT CTT TTT GAT AAG GAA TTA AAT GAA AAA GAA ATC AAA GAT TTA TAT 3 264 

Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr 
1075 1080 1085 



70 



GAT AAT CAA TCA AAT TCA GGT ATT TTA AAA GAC TTT TGG GGT GAT TAT 3 3i: 

Asp Asn Gin Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr 
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1090 1095 1100 

TTA CAA TAT GAT AAA CCA TAG TAT ATG TTA AAT TTA TAT GAT CCA AAT 3360 
Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn 
1105 1110 1115 1120 

AAA TAT GTC GAT GTA AAT AAT GTA GGT ATT AGA GGT TAT ATG TAT CTT 3408 
Lys Tyr Val Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu 
1125 1130 1135 

AAA GGG CCT AGA GGT AGO GTA ATG ACT ACA AAC ATT TAT TTA AAT TCA 34 56 

Lys Gly Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Aen Ser 
1140 1145 1150 

15 AGT TTG TAT AGG GGG ACA AAA TTT ATT ATA AAA AAA TAT GCT TCT GGA 3 504 

Ser Leu Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly 
1155 1160 1165 

AAT AAA GAT AAT ATT GTT AGA AAT AAT GAT CGT GTA TAT ATT AAT GTA 3552 
20 Asn Lys Asp Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val 

1170 1175 1180 

GTA GTT AAA AAT AAA GAA TAT AGG TTA GCT ACT AAT GCA TCA CAG GCA 3600 
Val Val Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala 
-3 1185 1190 1195 1200 

GGC GTA GAA AAA ATA CTA AGT GCA TTA GAA ATA CCT GAT GTA GGA AAT 36 4 8 

Gly Val Glu Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn 

1205 1210 1215 

j() 

CTA AGT CAA GTA GTA GTA ATG AAG TCA AAA AAT GAT CAA GGA ATA ACA 3696 

Leu Ser Gin Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr 

1220 1225 1230 

35 AAT AAA TGC AAA ATG AAT TTA CAA GAT AAT AAT GGG AAT GAT ATA GGC 3 74 4 

Asn Lys Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly 
1235 1240 1245 

TTT ATA GGA TTT CAT CAG TTT AAT AAT ATA GCT AAA CTA GTA GCA AGT 3 792 

40 Phe He Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser 

1250 1255 1260 

AAT TGG TAT AAT AGA CAA ATA GAA AGA TCT AGT AGG ACT TTG GGT TGC 384 0 

Asn Trp Tyr Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cvs 
1265 1270 1275 1280 



50 



TCA TGG GAA TTT ATT CCT GTA GAT GAT GGA TGG GGA GAA AGG CCA CTG 38 88 

Ser Trp Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
1285 1290 1295 



TAA 

(2) INFORMATION FOR SEQ ID NO:28: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 96 amino acids 

(B) TYPE: ammo acid 
(D) TOPOLOGY: linear 

^>0 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: 

Met Gin Phe Val Asn Lys Gin Phe Asn Tyr Lys Asp Pro Val Asn Gly 
15 10 15 

Val Asp He Ala Tyr He Lys He Pro Asn Val Gly Gin Met Gin Pro 

20 25 30 



3891 



70 



Val Lys Ala Phe Lys He His Asn Lys He Trp Val He Pro Glu Arg 
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35 40 



45 



Asp Thr Phe Thr Asn Pro Glu Giu Gly Asp Leu Asn Pro Pro Pro Glu 

55 60 

Ala Lys Gin Val Pro Val Ser Tyr Tyr Asp Ser Thr Tyr Leu Ser Thr 

70 75 80 

Asp Asn Glu Lys Asp Asn Tyr Leu Lys Gly Val Thr Lys Leu Phe Glu 
85 90 95 

Arg He Tyr Ser Thr Asp Leu Gly Arg Met Leu Leu Thr Ser lie Val 
100 105 

Arg Gly He Pro Phe Trp Gly Gly Ser Thr He Asp Thr Glu Leu Lys 
115 120 125 

val lie Asp Thr Asn Cys lie Asn Val He Gin Pro Asp Gly Ser Tyr 
130 135 

Arg Ser Glu Glu Leu Asn Leu Val He He Gly Pro Ser Ala Asp He 
150 155 160 

He Gin Phe Glu Cys Lys Ser Phe Gly His Glu Val Leu Asn Lea Thr 
165 170 

Arg Asn Gly Tyr Gly Ser Thr Gin Tyr He Arg Phe Ser Pro Asp Phe 
180 185 190 

Thr Phe Gly Phe Glu Glu Ser Leu Giu Val Asp Thr Asn Pro Leu Leu 
195 200 205 

Gly Ala Gly Lys Phe Ala Thr Asp Pro Ala Val Thr Leu Ala His Glu 

-10 215 220 

Leu He His Ala Gly His Arg Leu Tyr Gly He Ala He Asn Pro Asn 
-25 230 235 240 

Arg Val Phe Lys Val Asn Thr Asn Ala Tyr Tyr Glu Met Ser Gly Leu 
245 250 255 

Glu Val Ser Phe Glu Glu Leu Arg Thr Phe Gly Gly His Asp Ala Lys 
260 265 270 

Phe He Asp Ser Leu Gin Glu Asn Glu Phe Arg Leu Tyr Tyr Tyr Asn 
^75 280 285 

Lys Phe Lys Asp He Ala Ser Thr Leu Asn Lys Ala Lys Ser He Val 

295 300 

Gly Thr Thr Ala Ser Leu Gin Tyr Met Lys Asn Val Phe Lys Giu Lvs 
310 315 320 

Tyr Leu Leu Ser Glu Asp Thr Ser Gly Lys Phe Ser Val Asp Lys Leu 
325 330 335 

Lys Phe Asp Lys Leu Tyr Lys Met Leu Thr Glu He Tyr Thr Glu Asp 
340 345 350 

Asn Phe Val Lys Phe Phe Lys Val Leu Asn Arg Lys Thr Tyr Leu Asn 
355 360 365 

Phe Asp Lys Ala Val Phe Lys He Asn He Val Pro Lys Val Asn Tyr 
3*?0 375 

Thr He Tyr Asp Gly Phe Asn Leu Arg Asn Thr Asn Leu Ala Ala Asn 
390 395 400 

Phe Asn Gly Gin Asn Thr Glu He Asn Asn Met Asn Phe Thr Lys Leu 
405 410 415 
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Lys Asn Phe Thr Gly Leu Phe Glu Phe Tyr Lys Leu Leu Cys Val Arg 
420 425 430 

Gly lie He Thr Ser Lys Thr Lys Ser Leu Asp Lys Gly Tyr Asn Lys 
5 435 440 445 

Ala Leu Asn Asp Leu Cys lie Lys Val Asn Asn Trp Asp Leu Phe Phe 
450 455 460 

10 Ser Pro Ser Glu Asp Asn Phe Thr Asn Asp Leu Asn Lys Gly Glu Glu 

465 470 475 480 



15 



30 



45 



60 



70 



He Thr Ser Asp Thr Asn He Glu Ala Ala Glu Glu Asn Xle Ser Leu 
485 490 495 

Asp Leu He Gin Gin Tyr Tyr Leu Thr Phe Asn Phe Asp Asn Glu Pro 
500 505 510 



Glu Asn He Ser He Glu Asn Leu Ser Ser Asp He He Gly Gin Leu 

20 515 520 525 

Glu Leu Met Pro Asn He Glu Arg Phe Pro Asn Gly Lys Lys Tyr Glu 

530 535 540 

25 Leu Asp Lys Tyr Thr Met Phe His Tyr Leu Arg Ala Gin Glu Phe Glu 

545 550 555 560 



His Gly Lys Ser Arg He Ala Leu Thr Asn Ser Val Asn Glu Ala Leu 
565 570 575 

Leu Asn Pro Ser Arg Val Tyr Thr Phe Phe Ser Ser Asp Tyr Val Lys 
580 585 590 



Lys Val Asn Lys Ala Thr Glu Ala Ala Met Phe Leu Gly Trp Val Glu 

35 595 600 605 

Gin Leu Val Tyr Asp Phe Thr Asp Glu Thr Ser Glu Val Ser Thr Thr 

SIO 615 620 

40 Asp Lys He Ala Asp He Thr He He He Pro Tyr He Gly Pro Ala 

625 630 635 640 



Leu Asn He Gly Asn Met Leu Tyr Lys Asp Asp Phe Val Gly Ala Leu 
645 650 655 

He Phe Ser Gly Ala Val He Leu Leu Glu Phe He Pro Glu He Ala 
660 665 670 



He Pro Val Leu Gly Thr Phe Ala Leu Val Ser Tyr He Ala Asn Lys 
50 675 680 685 

Val Leu Thr Val Gin Thr He Asp Asn Ala Leu Ser Lys Arg Asn Glu 
690 695 700 

55 Lys Trp Asp Glu Val Tyr Lys Tyr He Val Thr Asn Trp Leu Ala Lys 

705 710 715 720 



Val Asn Thr Gin He Asp Leu He Arg Lys Lys Met Lys Glu Ala Leu 

725 730 735 

Glu Asn Gin Ala Glu Ala Thr Lys Ala He He Asn Tyr Gin Tyr Asn 

740 745 750 



Gin Tyr Thr Glu Glu Glu Lys Asn Asn He Asn Phe Asn He Asp Asp 
6:) 755 760 765 



Leu Ser Ser Lys Leu Asn Glu Ser He Asn Lys Ala Met He Asn He 
770 775 780 

Asn Lys Phe Leu Asn Gin Cys Ser Val Ser Tyr Leu Met Asn Ser Met 
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He Pro Tyr Gly Vai Lys Arg Leu Glu Asp Phe Asp Ala Ser Leu Lys 
805 810 815 

Asp Ala Leu Leu Lys Tyr He Tyr Asp Asn Arg Gly Thr Leu He Gly 
820 825 830 

Gin Val Asp Arg Leu Lys Asp Lys Val Asn Asn Thr Leu Ser Thr Asp 
835 840 845 

He Pro Phe Gin Leu Ser Lys Tyr Val Asp Asn Gin Arg Leu Leu Ser 
850 855 860 

Thr Phe Thr Glu Tyr He Lys Asn He He Asn Thr Ser He Leu Asn 
865 870 875 SBO 

Leu Arg Tyr Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser 
885 890 895 

Lys He Asn He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn 
900 905 910 

Gin He Gin Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu 
915 920 925 

Lys Asn Ala He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser 
930 935 940 

Phe Trp He Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn 
945 950 955 960 

Glu Tyr Thr He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val 
965 970 975 

Ser Leu Asn Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu 
980 985 990 

He Lys Gin Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser 
995 1000 1005 

Asp Tyr He Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu 
1010 1015 1020 

Asn Asn Ser Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro 
-025 1030 1035 1040 

He Ser Asn Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys 
1045 1050 1055 

Leu Asp Gly Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe 
1060 1065 1070 

Asn Leu Phe Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr 
1075 1080 1085 

Asp Asn Gin Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr 
1090 1095 1100 

Leu Gin Tyr Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn 
H05 ' 1110 1115 1120 

Lys Tyr Val Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu 
H25 1130 1135 

Lys Gly Pro Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser 
1140 1145 1150 

Ser Leu Tyr Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly 
1155 1160 1165 



wo 98/08540 



PCT/US97/15394 



Asn hys Asp Asn lie Val Arg Asn Asn Asp Arg Val Tyr lie Asn Val 
1170 ' 1175 1180 

Val Val Lys Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala 
5 1185 1190 1195 1200 

Gly Val Glu Lys lie Leu Ser Ala Leu Glu lie Pro Asp Val Gly Asn 
1205 1210 1215 

10 Leu Ser Gin Val Val Val Met Lys Ser hys Asn Asp Gin Gly He Thr 

1220 1225 1230 



15 



Asn Lys Cys Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly 
1235 1240 1245 

Phe He Gly Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser 
1250 1255 1260 



Asn Trp Tyr Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cys 
20 1265 1270 1275 1280 

Ser Trp Glu Phe He Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
1285 1290 1295 

25 (2) INFORMATION FOR SEQ ID NO; 29: 

fi} SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
(Bl TYPE: nucleic acid 
?A) (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc "DNA" 

35 

i'Ai) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CGCCATGGCT AGATTATTAT CTACATTTAC 30 
40 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

ix±) MOLECULE TYPE: other nucleic acid 
fA) DESCRIPTION: /desc = "DNA" 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GCAAGCTTCT TGACAGACTC ATGTAG 26 
55 (2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 

(B) TYPE: nucleic acid 
60 {C) STRANDEDNESS: double 

~ { D ) TOPOLOGY : 1 inea r 

<ii) MOLECULE TYPE: DNA (genomic) 
65 (xi) SEQUENCE DESCRIPTION: SEQ 10 N0:3l: 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 60 
TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACCATG GGCCATCATC 120 



70 



- 277 - 



wo 98/08540 



PCT/US97/153W 



10 



20 



40 



60 



65 



ATCATCATCA 


TCATCATCAT 


CACAGCAGCG 


GCCATATCGA 


AGGTCGTCAT 


ATGGCTAGCA 


TGGCTAGATT 


ATTATCTACA 


TTTACTGAAT 


ATATTAAGAA 


TATTATTAAT 


ACTTCTATAT 


TGAATTTAAG 


ATATGAAAGT 


AATCATTTAA 


TAGACTTATC 


TAGGTATGCA 


TCAAAAATAA 


ATATTGGTAG 


TAAAGTAAAT 


TTTGATCCAA 


TAGATAAAAA 


TCAAATTCAA 


TTATTTAATT 


TAGAAAGTAG 


TAAAATTGAG 


GTAATTTTAA 


AAAATGCTAT 


TGTATATAAT 


AGTATGTATG 


AAAATTTTAG 


TACTAGCTTT 


TGGATAAGAA 


TTCCTAAGTA 


TTTTAACAGT 


ATAAGTCTAA 


ATAATGAATA 


TACAATAATA 


AATTGTATGG 


AAAATAATTC 


AGGATGGAAA 


GTATCACTTA 


ATTATGGTGA 


AATAATCTGG 


ACTTTACAGG 


ATACTCAGGA 


AATAAAACAA 


AGAGTAGTTT 


TTAAATACAG 


TCAAATGATT 


AATATATCAG 


ATTATATAAA 


CAGATGGATT 


TTTGTAACTA 


TCACTAATAA 


TAGATTAAAT 


AACTCTAAAA 


TTTATATAAA 


TGGAAGATTA 


ATAGATCAAA 


AACCAATTTC 


AAATTTAGGT 


AATATTCATG 


CTAGTAATAA 


TATAATGTTT 


AAATTAGATG 


GTTGTAGAGA 


TACACATAGA 


TATATTTGGA 


TAAAATATTT 


TAATCTTTTT 


GATAAGG7VAT 


TAAATGAAAA 


AGAAATCAAA 


GATTTATATG 


ATAATCAATC 


AAATTCAGGT 


ATTTTAAAAG 


ACTTTTGGGG 


TGATTATTTA 


CAATATGATA 


AACCATACTA 


TATGTTAAAT 


TTATATGATC 


CAAATAAATA 


TGTCGATGTA 


AATAATGTAG 


GTATTAGAGG 


TTATATGTAT 


CTTAAAGGGC 


CTAGAGGTAG 


CGTAATGACT 


ACAAACATTT 


ATTTAAATTC 


AAGTTTGTAT 


AGGGGGACAA 


AATTTATTAT 


AAAAAAATAT 


GCTTCTGGAA 


ATAAAGATAA 


TATTGTTAGA 


AATAATGATC 


GTGTATATAT 


TAATGTAGTA 


GTTAAAAATA 


AAGAATATAG 


GTTAGCTACT 


AATGCATCAC 


AGGCAGGCGT 


AGAAAAAATA 


CTAAGTGCAT 


TAGAAATACC 


TGATGTAGGA 


AATCTAAGTC 


AAGTAGTAGT 


AATGAAGTCA 


AAAAATGATC 


AAGGAATAAC 


AAATAAATGC 


AAAATGAATT 


TACAAGATAA 


TAATGGGAAT 


GATATAGGCT 


TTATAGGATT 


TCATCAGTTT 


AATAATATAG 


CTAAACTAGT 


AGCAAGTAAT 


TGGTATAATA 


GACAAATAGA 


AAGATCTAGT 


AGGACTTTGG 


GTTGCTCATG 


GGAATTTATT 


CCTGTAGATG 


ATGGATGGGG 


AGAAAGGCCA 


CTGTAATTAA 


TCTCAAACTA 


CATGAGTCTG 


TCAAGAAGCT 


TGCGGCCGCA 


CTCGAG 




(2) INFORtOATION FOR SEQ ID NO; 32 









180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
B40 
900 
96 0 
1020 
1080 
1140 

3^ GTGTATATAT TAATGTAGTA GTTAAAAATA AAGAATATAG GTTAGCTACT AATGCATCAC 1200 

1260 
1320 
1380 
1440 

45 GTTGCTCATG GGAATTTATT CCTGTAGATG ATGGATGGGG AGAAAGGCCA CTGTAATTAA 1500 



50 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amxno acids 
(B> TYPE: amino acid 
(C) STRANDEDNESS : not relevant 
55 (D) TOPOLOGY: not relevant 

lii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



Met His His His His His His Met Ala 
1 5 

(2) INFORMATION FOR SEO ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
70 (D) TOPOLOGY: linear 



1546 
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(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 

5 

TATGCATCAC CATCACCATC A 21 
(2) INFORMATION FOR SEQ ID NO: 34: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: other nucleic acid 
<A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CATGTGATGG TGATGGTGAT GCA 23 

(2) INFORMATION FOR SEQ ID NO:35: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(ix) FEATURE: 
35 (A> NAME/KEY: CDS 

(B) LOCATION: 1..1335 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

40 ATG CAT CAC CAT CAC CAT CAC ATG GCT CGT CTG CTG TCT ACC TTC ACT 4 8 

Met His His His His His His Met Ala Arg Leu Leu Ser Thr Phe Thr 
15 10 15 

GAA TAC ATC AAG AAC ATC ATC AAT ACC TCC ATC CTG AAC CTG CGC TAC 96 
45 Glu Tyr lie Lys Asn lie He Asn Thr Ser He Leu Asn Leu Arg Tyr 

20 25 30 

GAA TCC AAT CAC CTG ATC GAC CTG TCT CGC TAC GCT TCC AAA ATC AAC 144 
Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys lie Asn 
50 35 40 45 

ATC GGT TCT AAA GTT AAC TTC GAT CCG ATC GAC AAG AAT CAG ATC CAG 192 
He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He Gin 
50 55 60 

55 

CTG TTC AAT CTG GAA TCT TCC AAA ATC GAA GTT ATC CTG AAG AAT GCT 24 0 

Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn Ala 
65 70 75 80 

60 ATC GTA TAC AAC TCT ATG TAC GAA AAC TTC TCC ACC TCC TTC TGG ATC 288 

He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp lie 
85 90 95 

CGT ATC CCG AAA TAC TTC AAC TCC ATC TCT CTG AAC AAT GAA TAC ACC 336 
65 Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tyr Thr 
100 105 110 

ATC ATC AAC TGC ATG GAA AAC AAT TCT GGT TGG AAA GTA TCT CTG AAC 384 
He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu Asn 
70 115 120 125 
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65 



G?3 rU. rr 1°^ t?' '^'^^ ^A'^ «^ ATC AAA CAG 

Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu He Lys Gin 

135 

CGT GTT GTA TTC AAA TAC TCT CAG ATG ATC AAC ATC TCT GAC TAC ATC 
Arg val Val Phe Lys Tyr Ser Gin Met He Asn He Ser Asp Tyr He 



AAA ATC TAC ATC AAC GGC CGT CTG ATC GAC CAG AAA CCG ATC TCC AAT 
Lys lie Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro He Ser Asn 
IBO 185 190 



AAA ATG AAT CTG CAG GAC AAC AAT GGT AAC GAT ATC GGT TTC ATC GGT 
Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe He Glv 
390 395 400 



432 



480 



AAT CGC TGG ATG TTC GTT ACC ATC ACC AAC AAT CGT CTG AAT AAC TCC 52B 
10 Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn Ser 

165 170 175 



576 



624 



6 72 



720 



CTG GGT AAC ATC CAC GCT TCT AAT AAC ATC ATG TTC AAA CTG GAC GGT 
Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asp Glv 

200 205 

TGT CGT GAC ACT CAC CGC TAC ATC TGG ATC AAA TAC TTC AAT CTG TTC 
Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Asn Leu Phe 
210 215 220 

25 GAC AAA GAA CTG AAC GAA AAA GAA ATC AAA GAC CTG TAC GAC AAC CAG 

Asp Lys Glu Leu Asn Glu Lys Glu He Lvs Asp Leu Tyr Asp Asn Gin 
230 - 235 240 

>0 T"" I^"" n^'' ""T^ ''''^ '"'^^ '^^^ •TAC C-TG CAG TAC 768 

.>0 Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin Tyr 

245 250 255 

GAC AAA CCG TAC TAC ATG CTG AAT CTG TAC GAT CCG AAC AAA TAC GTT 
Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tyr Asp Pro Asn Lys Tyr Val 
260 265 270 

GAC GTC AAC AAT GTA GGT ATC CGC GGT TAC ATG TAC CTG AAA GGT CCG 
Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tvr Leu Lys Gly Pro 
40 280 • 285 

CGT GGT TCT GTT ATG ACT ACC AAC ATC TAC CTG AAC TCT TCC CTG TAC 
Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu Tyr 
-90 295 300 

45 CGT GGT ACC AAA TTC ATC ATC AAG AAA TAC GCG TCT GGT AAC AAG GAC 

Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lys Asp 
310 315 320 

AAT ATC GTT CGC AAC AAT GAT CGT GTA TAC ATC AAT GTT GTA GTT AAG 1008 
Asn He Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val Lys 
325 330 335 

AAC AAA GAA TAC CGT CTG GCT ACC AAT GCT TCT CAG GCT GGT GTA GAA 
Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val Glu 
--^ 340 345 350 

AAG ATC TTG TCT GCT CTG GAA ATC CCG GAC GTT GGT AAT CTG TCT CAG 
Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn Leu Ser Gin 
355 360 365 

GTA GTT GTA ATG AAA TCC AAG AAC GAC CAG GGT ATC ACT AAC AAA TGC 
Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr Asn Lys Cys 
370 375 380 



816 



B64 



912 



960 



1056 



1104 



1152 



1200 



TTC CAC CAG TTC AAC AAT ATC GCT AAA CTG GTT GCT TCC AAC TGG TAC 124 8 

/V' Phe His Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser Asn Trp Tyr 
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405 410 415 

AAT CGT CAG ATC GAA CGT TCC TCT CGC ACT CTG GGT TGC TCT TGG GAG 1296 
Asn Arg Gin lie Glu Arg Ser Ser Arg Thr Leu Gly Cys Ser Trp Glu 
420 425 430 

TTC ATC CCG GTT GAT GAC GGT TGG GGT GAA CGT CCG CTG TAACCCGGGA 134 5 

Phe lie Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
435 440 445 



AAGCTT 

(2) INFORMATION FOR SEQ ID NO: 36: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met His His His His His His Met Ala Arg Leu Leu Ser Thr Phe Thr 
is 10 15 

Glu Tyr He Lys Asn He He Asn Thr Ser He Leu Asn Leu Arg Tyr 
20 25 30 

Glu Ser Asn His Leu He Asp Leu Ser Arg Tyr Ala Ser Lys He Asn 
35 40 45 

He Gly Ser Lys Val Asn Phe Asp Pro He Asp Lys Asn Gin He Gin 
50 55 60 

Leu Phe Asn Leu Glu Ser Ser Lys He Glu Val He Leu Lys Asn Ala 
65 70 75 80 

He Val Tyr Asn Ser Met Tyr Glu Asn Phe Ser Thr Ser Phe Trp He 
85 90 95 

Arg He Pro Lys Tyr Phe Asn Ser He Ser Leu Asn Asn Glu Tyr Thr 
100 105 110 

He He Asn Cys Met Glu Asn Asn Ser Gly Trp Lys Val Ser Leu Asn 
115 120 125 

Tyr Gly Glu He He Trp Thr Leu Gin Asp Thr Gin Glu He Lys Gin 
130 135 140 

Arg Val Val Phe Lys Tyr Ser Gin Met He Asn He Ser Asp Tyr He 
145 150 155 160 

Asn Arg Trp He Phe Val Thr He Thr Asn Asn Arg Leu Asn Asn Ser 
165 170 175 

Lys He Tyr He Asn Gly Arg Leu He Asp Gin Lys Pro He Ser Asn 
180 185 190 

Leu Gly Asn He His Ala Ser Asn Asn He Met Phe Lys Leu Asp Gly 
195 200 205 

Cys Arg Asp Thr His Arg Tyr He Trp He Lys Tyr Phe Asn Leu Phe 
210 215 220 

Asp Lys Glu Leu Asn Glu Lys Glu He Lys Asp Leu Tyr Asp Asn Gin 
225 230 235 240 

Ser Asn Ser Gly He Leu Lys Asp Phe Trp Gly Asp Tyr Leu Gin Tyr 
245 250 255 



1351 
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Asp Lys Pro Tyr Tyr Met Leu Asn Leu Tvr Asp Pro Asn Lys Tyr Val 
260 265 270 

Asp Val Asn Asn Val Gly He Arg Gly Tyr Met Tyr Leu Lys Gly Pro 
275 280 285 

Arg Gly Ser Val Met Thr Thr Asn He Tyr Leu Asn Ser Ser Leu Tyr 
290 295 300 

Arg Gly Thr Lys Phe He He Lys Lys Tyr Ala Ser Gly Asn Lys Asp 

305 310 315 320 

Asn Ho Val Arg Asn Asn Asp Arg Val Tyr He Asn Val Val Val Lys 
325 330 335 

Asn Lys Glu Tyr Arg Leu Ala Thr Asn Ala Ser Gin Ala Gly Val Glu 
340 345 350 



Lys He Leu Ser Ala Leu Glu He Pro Asp Val Gly Asn Leu Ser Gin 
-^0 355 360 365 

Val Val Val Met Lys Ser Lys Asn Asp Gin Gly He Thr Asn Lys Cys 
370 375 380 

25 Lys Met Asn Leu Gin Asp Asn Asn Gly Asn Asp He Gly Phe He Gly 

385 390 395 400 

Phe Hxs Gin Phe Asn Asn He Ala Lys Leu Val Ala Ser Asn Trp Tyr 
405 410 415 

Al 

Asn Arg Gin He Glu Arg Ser Ser Arg Thr Leu Gly Cys Scr Trp Glu 
420 425 430 

Phe Ho Pro Val Asp Asp Gly Trp Gly Glu Arg Pro Leu 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
^0 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

50 CGCATATGAA TATTCGTCCA TTGCATG 27 

(2) INFORMATION FOR SEQ ID N0:3B: 

(i) SEQUENCE CHARACTERISTICS: 
^5 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
~ CA) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

65 GGAAGCTTGC AGGGCAATTA CATCATG 2 7 

(2) INFORMATION FOR SEQ ID NO : 3 9 : 



(i) SEQUENCE CHARACTERISTICS: 
70 (AJ LENGTH: 3676 base pairi: 
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225 230 



235 240 



10 



20 



30 



40 



50 



60 



65 



70 



GGC ATT AAA GTA GAT GAT TTA CCA ATT GTA CCA AAT GAA AAA AAA TTT 768 
Gly lie Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lvs Phe 
245 250 255 

TTT ATG CAA TCT ACA GAT ACT ATA CAG GCA GAA GAA CTA TAT ACA TTT 816 
Phe Met Gin Ser Thr Asp Thr He Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

GGA GGA CAA GAT CCC AGC ATC ATA TCT CCT TCT ACA GAT AAA AGT ATC 864 
Gly Gly Gin Asp Pro Ser He He Ser Pro Ser Thr Asp Lys aer He 
275 280 285 



\> TAT GAT AAA GTT TTG CAA AAT TTT AGG GGG ATA GTT GAT AGA CTT AAC 

Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly He Val Asp Arg Leu Asn 
-^0 295 300 



AAA AAT AAA TTT AAA GAT AAA TAT AAA TTC GTT GAA GAT TCT GAA GGA 
Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Glv 
325 330 335 



912 



AAG GTT TTA GTT TGC ATA TCA GAT CCT AAC ATT AAC ATT AAT ATA TAT 960 
Lys Val Leu Val Cys He Ser Asp Pro Asn He Asn He Asn He Tvr 
305 310 315 320 



1008 



AAA TAT AGT ATA GAT GTA GAA AGT TTC AAT AAA TTA TAT AAA AGC TTA 105 6 

Lys Tyr Ser He Asp Val Glu Ser Phe Asn Lys Leu Tyr Lys Ser Leu 
340 345 350 

ATG TTA GGT TTT ACA GAA ATT AAT ATA GCA GAA AAT TAT AAA ATA AAA 1104 
Met Leu Gly Phe Thr Glu He Asn He Ala Glu Asn Tyr Lys He Lys 
355 360 365 

ACT AGA GCT TCT TAT TTT AGT GAT TCC TTA CCA CCA GTA AAA ATA AAA 1152 
Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lvs 
370 375 380 

AAT TTA TTA GAT AAT GAA ATC TAT ACT ATA GAG GAA GGG TTT AAT ATA 1200 
Asn Leu Leu Asp Asn Glu He Tyr Thr He Glu Glu Gly Phe Asn He 
385 390 395 400 



TCT GAT AAA AAT ATG GGA AAA GAA TAT AGG GGT CAG AAT AAA GCT ATA 124 8 

Ser Asp Lys Asn Met Gly Lys Glu Tyr Arg Gly Gin Asn Lys Ala He 
^' 405 410 415 



AAT AAA CAA GCT TAT GAA GAA ATC AGC AAG GAG CAT TTG GCT GTA TAT 1296 

Asn Lys Gin Ala Tyr Glu Glu He Ser Lys Glu His Leu Ala Val Tyr 
420 425 430 

AAG ATA CAA ATG TGT AAA AGT GTT AAA GTT CCA GGA ATA TGT ATT GAT 1344 

Lys He Gin Met Cys Lys Ser Val Lys Val Pro Gly He Cys He Asp 
435 440 445 

^5 GTC GAT AAT GAA AAT TTG TTC TTT ATA GCT GAT AAA AAT AGT TTT TCA 13 92 

Val Asp Asn Glu Asn Leu Phe Phe He Ala Asp Lys Asn Ser Phe Ser 
450 455 460 



GAT GAT TTA TCT AAA AAT GAA AGA GTA GAA TAT AAT ACA CAG AAT AAT 144 0 

Asp Asp Leu Ser Lys Asn Glu Arg Val Glu Tyr Asn Thr Gin Asn Asn 
465 - 470 475 480 

TAT ATA GGA AAT GAC TTT CCT ATA AAT GAA TTA ATT TTA GAT ACT GAT 1488 
Tyr He Gly Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asp 
485 490 495 

TTA ATA AGT AAA ATA GAA TTA CCA AGT GAA AAT ACA GAA TCA CTT ACT 1536 
Leu He Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 
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GAT TTT AAT GTA GAT GTT CCA GTA TAT GAA AAA CAA CCC OCT ATA AAA 
Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala lie Lys 
515 520 525 

AAA GTT TTT ACA GAT GAA AAT ACC ATC TTT CAA TAT TTA TAC TCT CAG 
Lys Val Phe Thr Asp Glu Asn Thr lie Phe Gin Tyr Leu Tyr Ser Gin 
530 535 540 

ACA TTT CCT CTA AAT ATA AGA GAT ATA AGT TTA ACA TCT TCA TTT GAT 
Thr Phe Pro Leu Asn lie Arg Asp He Ser Leu Thr Ser Ser Phe Asp 
550 555 560 

GAT GCA TTA TTA GTT TCT AGC AAA GTT TAT TCA TTT TTT TCT ATG GAT 
Asp Ala Leu Leu Val Ser Ser Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 575 

TAT ATT AAA ACT GCT AAt AAA GTA GTA GAA GCA GGA TTA TTT GCA GGT j 
Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly 
580 585 590 

TGG GTG AAA CAG ATA GTA GAT GAT TTT GTA ATC GAA GCT AAT AAA AGC ] 
Trp Val Lys Gin He Val Asp Asp Phe Val He Glu Ala Asn Lys Ser 
595 600 605 

AGT ACT ATG GAT AAA ATT GCA GAT ATA TCT CTA ATT GTT CCT TAT ATA ] 
Ser Thr Met Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr He 
610 615 620 

GGA TTA GCT TTA AAT GTA GGA GAT GAA ACA GCT AAA GGA AAT TTT GAA 1 
Gly Leu Ala Leu Asn Val Gly Asp Glu Thr Ala Lys Gly Asn Phe Glu 
^25 630 635 640 

AGT GCT TTT GAG ATT GCA GGA TCC AGT ATT TTA CTA GAA TTT ATA CCA 3 
Ser Ala Phe Glu He Ala Gly Ser Ser He Leu Leu Glu Phe He Pro 
645 650 655 

GAA CTT TTA ATA CCT GTA GTT GGA GTC TTT TTA TTA GAA TCA TAT ATT 2 
Glu Leu Leu He Pro Val Val Gly Val Phe Leu Leu Glu Ser Tyr He 
660 665 670 

GAC AAT AAA AAT AAA ATT ATT AAA ACA ATA GAT AAT GCT TTA ACT AAA 2 
Asp Asn Lys Asn Lys He He Lys Thr He Asp Asn Ala Leu Thr Lys 
675 680 685 

AGA GTG GAA AAA TGG ATT GAT ATG TAC GGA TTA ATA GTA GCG CAA TGG 2 
Arg Val Glu Lys Trp He Asp Met Tyr Gly Leu He Val Ala Gin Trp 
690 695 700 

CTC TCA ACA GTT AAT ACT CAA TTT TAT ACA ATA AAA GAG GGA ATG TAT 2 
Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tyr 
705 710 715 720 

AAG GCT TTA AAT TAT CAA GCA CAA GCA TTG GAA GAA ATA ATA AAA TAC 1. 
Lys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He He Lys Tyr 
725 730 735 

AAA TAT AAT ATA TAT TCT GAA GAG GAA AAG TCA AAT ATT AAC ATC AAT 2 
Lys Tyr Asn He Tyr Ser Glu Glu Glu Lys Ser Asn He Asn He Asn 
740 745 750 

TTT AAT GAT ATA AAT TCT AAA CTT AAT GAT GGT ATT AAC CAA GCT ATG 2 
Phe Asn Asp He Asn Ser Lys Leu Asn Asp Gly He Asn Gin Ala Met 
755 760 765 

GAT AAT ATA AAT GAT TTT ATA AAT GAA TGT TCT GTA TCA TAT TTA ATG 2 
Asp Asn He Asn Asp Phe He Asn Glu Cys Ser Val Ser Tyr Leu Met 
770 775 780 



AAA AAA ATG ATT CCA TTA GCT GTA AAA AAA TTA CTA GAC TTT GAT AAT 
Lys Lys Met He Pro Leu Ala Val Lys Lys Leu Leu Asp Phe Asp Asn 



2 
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40 



50 



60 



"785 . 790 795 800 

ACT CTC AAA AAA AAT TTA TTA AAT TAT ATA GAT GAA AAT AAA TTA TAT 2448 
Thr Leu Lys Lys Asn Leu Leu Asn Tyr lie Asp Glu Asn Lys Leu Tyr 
805 810 815 

TTA ATT GGA AGT GTA GAA GAT GAA AAA TCA AAA GTA GAT AAA TAC TTG 24 96 

Leu lie Gly Ser Val Glu Asp Glu Lys Ser Lys Val Asp Lys Tyr Leu 
820 825 830 

AAA ACC ATT ATA CCA TTT GAT CTT TCA ACG TAT TCT AAT ATT GAA ATA 2544 
Lys Thr lie He Pro Phe Asp Leu Ser Thr Tyr Ser Asn He Qlu He 
835 840 845 



CTA ATA AAA ATA TTT AAT AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT 
Leu lie Lys He Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He 
850 855 860 



2592 



ATC TTA AAT TTA AGA TAT AGA GAT AAT AAT TTA ATA GAT TTA TCA GGA 2640 

-0 He Leu Asn Leu Arg Tyr Arg Asp Asn Asn Leu He Asp Leu Ser Gly 

865 870 875 880 

TAT GGA GCA AAG GTA GAG GTA TAT GAT GGG GTC AAG CTT AAT GAT AAA 2 688 

Tyr Gly Ala Lys Val Glu Val Tyr^ Asp Gly Val Lys Leu Asn Acp Lys 
-5 885 890 895 

AAT CAA TTT AAA TTA ACT AGT TCA GCA GAT AGT AAG ATT AGA GTC ACT 2 73 6 

Asn Gin Phe Lys Leu Thr Ser Ser Ala Asp Ser Lys He Arg Val Thr 
^00 905 910 

CAA AAT CAG AAT ATT ATA TTT AAT AGT ATG TTC CTT GAT TTT AGC GTT 27 84 

Gin Asn Gin Asn He He Phe Asn Ser Met Phe Leu Asp Phe Ser Val 
915 920 925 

35 AGC TTT TGG ATA AGG ATA CCT AAA TAT AGG AAT GAT GAT ATA CAA AAT 2832 

Ser Phe Trp He Arg He Pro Lys Tyr Arg Asn Asp Asp He Gin Asn 
930 935 940 



TAT ATT CAT AAT GAA TAT ACG ATA ATT AAT TGT ATG AAA AAT AAT TCA ' 2880 
Tyr He His Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser 
9't5 950 955 960 



GGC TGG AAA ATA TCT ATT AGG GGT AAT AGG ATA ATA TGG ACC TTA ATT 2 928 

Gly Trp Lys He Ser He Arg Gly Asn Arg He He Trp Thr Leu He 
'^^ 965 970 975 



GAT ATA AAT GGA AAA ACC AAA TCA GTA TTT TTT GAA TAT AAC ATA AGA 2976 

Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arg 
980 985 ' 990 

GAA GAT ATA TCA GAG TAT ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT 3 02 4 

Glu Asp He Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr 
995 1000 1005 



:>3 AAT AAT TTG GAT AAT GCT AAA ATT TAT ATT AAT GGC ACG TTA GAA TCA 3 072 

Asn Asn Leu Asp Asn Ala Lys He Tyr He Asn Gly Thr Leu Glu Ser 
lOlO 1015 1020 



AAT ATG GAT ATT AAA GAT ATA GGA GAA GTT ATT GTT AAT GGT GAA ATA 3120 
Asn Met Asp He Lys Asp He Gly Glu Val He Val Asn Gly Glu He 
1025 - 1030 1035 1040 



ACA TTT AAA TTA GAT GGT GAT GTA GAT AGA ACA CAA TTT ATT TGG ATG 316 8 

Thr Phe Lys Leu Asp Gly Asp Val Asp Arg Thr Gin Phe He Trp Met 
6^ 1045 1050 1055 



70 



AAA TAT TTT AGT ATT TTT AAT ACG CAA TTA AAT CAA TCA AAT ATT AAA 3216 
Lys Tyr Phe Ser He Phe Asn Thr Gin Leu Asn Gin Ser Asn He Lys 
1060 1065 1070 
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GAG ATA TAT AAA ATT CAA TCA TAT AGC GAA TAG TTA AAA GAT TTT TGG 3264 
Glu He Tyr Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp 
lO'JS 1080 1085 

5 GGA AAT CCT TTA ATG TAT AAT AAA GAA TAT TAT ATG TTT AAT GCG GGG 3312 

Gly Asn Pro Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly 
1090 1095 1100 

AAT AAA AAT TCA TAT ATT AAA CTA GTG AAA GAT TCA TCT GTA GGT GAA 3360 
10 Asn Lys Asn Ser Tyr He Lys Leu Val Lys Asp Ser Ser Val Gly Glu 

1105 1110 1115 1120 

ATA TTA ATA CGT AGC AAA TAT AAT CAG AAT TCC AAT TAT ATA AAT TAT 3408 
He Leu He Arg Ser Lys Tyr Asn Gin Asn Ser Asn Tyr He Asn Tyr 
15 1125 1130 1135 

AGA AAT TTA TAT ATT GGA GAA AAA TTT ATT ATA AGA AGA GAG TCA AAT 34 56 

Arg Asn Leu Tyr He Gly Glu Lys Phe He lie Arg Arg Glu Ser Asn 
1140 1145 1150 

20 

TCT CAA TCT ATA AAT GAT GAT ATA GTT AGA AAA GAA GAT TAT ATA CAT 3504 

Ser Gin Ser He Asn Asp Asp He Val Arg Lys Glu Asp Tyr He Hxs 
1155 1160 1165 

25 CTA GAT TTG GTA CTT CAC CAT GAA GAG TGG AGA GTA TAT GCC TAT AAA 3552 

Leu Asp I.eu Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr Lys 
1170 1175 1180 

TAT TTT AAG GAA CAG GAA GAA AAA TTG TTT TTA TCT ATT ATA ACT GAT 3600 
30 Tyr Phe Lys Glu Gin Glu Glu Lys Leu Phe Leu Ser He He Ser Asp 

HB5 1190 1195 1200 

TCT AAT GAA TTT TAT AAG ACT ATA GAA ATA AAA GAA TAT GAT GAA CAG 3648 
Ser Asn Glu Phe Tyr Lys Thr He Glu He Lys Glu Tyr Asp Glu Gin 
35 1205 1210 1215 

CCA TCA TAT AGT TGT CAG TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT 36 96 

Pro Ser Tyr Ser Cys Gin Leu Leu phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

40 

GAT GAT ATA GGA TTG ATT GGT ATT CAT CGT TTC TAC GAA TCT GGA GTT 3 74 4 

Asp Asp He Gly Leu He Gly He His Arg Phe Tyr Glu Ser Gly Val 

1235 1240 ' 1245 

45 TTA CGT fiJJi AAG TAT AAA GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA 3 7 92 

Leu Arg Lys Lys Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu 
1250 ' 1255 1260 

AAA GAG GTA AAA AGG AAA CCA TAT AAG TCA AAT TTG GGA TGT AAT TGG 384 0 

50 Lys Glu Val Lys Arg Lys Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp 

1265 1270 1275 1280 

CAG TTT ATT CCT AAA GAT GAA GGG TGG ACT GAA TAA 3876 
Gin Phe He Pro Lys Asp Glu Gly Trp Thr Glu 
55 1285 1290 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1291 amxno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



65 



70 



(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Pro Val Thr He Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 
IS 10 15 
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]() 



15 



45 



60 



Asp Asn He He Met Met Glu Pro Pro Phe Ala Arg Gly Thr Glv Arg 

20 25 30 

Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Trp He He Pro Glu 

35 40 45 

Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Glv 

50 55 60 

He Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 

65 70 75 80 

Thr Asn Asp Lys Lys Asn He Phe Phe Gin Thr Leu He Lys Leu Phe 

85 90 95 

Asn Arg He Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 

100 105 110 



He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 
-(^ 115 120 125 

Phe Asn Thr Asn He Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 
130 135 140 

2> Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He He 

i'15 150 155 160 

Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr He Asp He Gly 
1-65 170 175 

He Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly He Met Gin 
180 185 190 

Met Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 

Asn Lys Gly Ala Ser He Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
210 215 220 

40 Ala Leu He Leu Met His Glu Leu He His Val Leu His Gly Leu Tyr 

225 230 235 240 

Gly He Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lys Phe 
245 250 255 

Phe Met Gin Ser Thr Asp Thr He Gin Ala Glu Glu Leu Tvr Thr Phe 
260 265 270 

Gly Gly Gin Asp Pro Ser He He Ser Pro Ser Thr Asp Lys Ser He 
^0 275 280 285 

Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly He Val Asp Arg Leu Asn 
290 295 300 

5:> Lys Val Leu Val Cys He Ser Asp Pro Asn He Asn Tie Asn He Tyr 

305 310 315 320 

Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Gly 
325 330 335 

Lys Tyr Ser He Asp Val Glu Ser Phe Asn Lys Leu Tyr Lys Ser Leu 
340 345 ' 350 

Met Leu Gly Phe Thr Glu He Asn He Ala Glu Asn Tyr Lys He Lys 
355 360 365 

Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lys 
370 375 380 



70 



Asn Leu Leu Asp Asn Glu He Tyr Thr He Glu Glu Gly Phe Asn He 
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385 390 395 400 

Ser Asp Lys Asn Met Gly Lyn Glu Tyr Arg Gly Gin Asn Lys Ala lie 
405 410 415 

5 

Asn Lys Gin Ala Tyr Glu Glu lie Ser Lys Glu His Leu Ala Val Tyr 
420 425 430 

Lys He Gin Met Cys Lys Ser Val Lys Val Pro Gly He Cys He Asp 
10 435 440 445 

Val Asp Asn Glu Asn Leu Phe Phe He Ala Asp Lys Asn Ser Rhe Ser 
450 455 460 

15 Asp Asp Leu Ser Lys Asn Glu Arg Val Glu Tyr Asn Thr Gin Asn Asn 

465 470 475 480 



20 



Tyr He Gly Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asp 
485 490 495 

Leu He Ser Lys lie Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 SOS 510 

Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala He Lvs 
25 515 520 525 

Lys Val Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr Ser Gin 
530 535 540 

30 Thr Phe Pro Leu Asn He Arg Asp He Ser Leu Thr Scr Ser Phe Asp 

545 550 555 560 



50 



60 



65 



Asp Ala Leu Leu Val Ser Ser Lys Val Tyr Ser Phe Phe Ser Met Asp 
565 570 575 

Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Gly 
580 585 590 



Trp val Lys Gin He Val Asp Asp Phe Val He Glu Ala Asn Lys Ser 
40 595 600 605 

Ser Thr Met Asp Lys He Ala Asp He Ser Leu He Val Pi-o Tyr He 
610 615 620 

45 Gly Leu Ala Leu Asn Val Gly Asp Glu Thr Ala Lys Gly Asn Phe Glu 

625 630 635 640 



Ser Ala Phe Glu He Ala Gly Ser Ser He Leu Leu Glu Phe He Pro 
645 650 655 

Glu Leu Leu He Pro Val Val Gly Val Phe Leu Leu Glu Ser Tyr He 
660 665 670 

Asp Asn Lys Asn Lys He He Lys Thr He Asp Asn Ala Lea Thr Lys 
675 680 685 

Arg Val Glu Lys Trp He Asp Met Tyr Gly Leu He Val Ala Gin Trp 
690 ' 695 700 

Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tyr 
705 710 715 720 

Lys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He He Lys Tyr 
725 730 735 

Lys Tyr Asn He Tyr Ser Glu Glu Glu Lys Ser Asn He Asn He Asn 
740 745 750 



Phe Asn Asp He Asn Ser Lys Leu Asn Asp Gly He Asn Gin Ala Met 
/O 7 55 76 0 76 5 
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Asp Asn lie Asn Asp Phe lie Asn Glu Cys Ser Val Ser Tyr Leu Met 
'^''O 775 780 

Lys Lys Met He Pro Leu Ala Val Lys Lys Leu Leu Asp Phe Asp Asn 
"^90 795 BOO 

Thr Leu Lys Lys Asn Leu Leu Asn Tyr He Asp Glu Asn Lys Leu Tvr 
805 810 815 ^ 

I.eu He Gly Ser Val Glu Asp Glu Lys Ser Lys Val Asp Lys Tyr Leu 
920 825 830 

Lys Thr He He Pro Phe Asp Leu Ser Thr Tyr Ser Asn He Glu He 
835 840 845 

Leu He Lys He Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He 
950 855 860 

He Leu Asn Leu Arg Tyr Arg Asp Asn Asn Leu He Asp Leu Ser Glv 
965 870 875 880 

Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Lys Leu Asn Asp Lvs 
885 890 895 

-^^ Asn Gin Phe Lys Leu Thr Ser Ser Ala Asp Ser Lys He Arg Val Thr 

900 905 910 

Gin Asn Gin Asn He He Phe Asn Ser Met Phe Leu Asp Phe Ser Va] 
^^^j 920 925 

Ser Phe Trp He Arg He Pro Lys Tyr Arg Asn Asp Asp He Gin Asn 
9^0 935 94Q 

^ Tyr He His Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser 

-'^ 945 950 955 960 

Gly Trp Lys He Ser He Arg Gly Asn Arg He He Trp Thr Leu lie 
96S 970 975 

Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arq 
980 985 990 

Glu Asp He Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr 
995 1000 1005 

Asn Asn Leu Asp Asn Ala Lys He Tyr He Asn Gly Thr Leu Glu Ser 
1010 1015 102O 

Asn Met Asp He Lys Asp He Gly Glu Val He Val Asn Gly Glu He 
-^0"^ 3030 1035 1040 

Thr Phe Lys Leu Asp Gly Asp Val Asp Arg Thr Gin Phe He Trp Met 
1045 1050 1055 

5.^ Lys Tyr Phe Ser lie Phe Asn Thr Gin Leu Asn Gin Ser Asn He Lys 

1060 1065 1070 

Glu He Tyr Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp 
1075 1080 1085 

Gly Asn Pro Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly 
1090 1095 1100 

Asn Lys Asn Ser Tyr He Lys Leu Val Lys Asp Ser Ser Val Gly Glu 
^' 1105 1110 1115 1120 

He Leu He Arg Ser Lys Tyr Asn Gin Asn Ser Asn Tyr He Asn Tyr 
1125 1130 1135 

70 Arg Asn Leu Tyr He Gly Glu Lys Phe He He Arg Arg Glu Ser Asn 
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1140 1145 1150 

Ser Gin Ser lie Asn Asp Asp lie Val Arg Lys Glu Asp Tyr He His 
1155 1160 1165 

5 

Leu Asp Leu Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr Lys 
1170 1175 1180 

Tyr Phe Lys* Glu Gin Glu Glu Lys Leu Phe Leu Ser He lie Ser Asp 
10 1185 1190 1195 1200 

Ser Asn Glu Phe Tyr Lys Thr He Glu He Lys Glu Tyr Asp Glu Gin 
1205 1210 1215 

15 Pro Ser Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 

1220 1225 1230 



20 



Asp Asp He Gly Leu He Gly He His Arg Phe Tyr Glu Ser Gly Val 
1235 1240 1245 

Leu Arg Lys Lys Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu 
1250 1255 1260 

Lys Glu Val Lys Arg Lys Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp 
25 1265 1270 1275 1280 

Gin Phe lie Pro Lys Asp Glu Gly Trp Thr Glu 
1285 1290 

30 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3876 base pairs 

(B) TYPE: nucleic acid 
35 to STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA tgenomic) 

40 (iK) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 3873 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG CCA GTT ACA ATA AAT AAT TTT AAT TAT AAT GAT CCT ATT GAT AAT 4 8 

Met Pro Val Thr He Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 
15 10 15 

50 AAT AAT ATT ATT ATG ATG GAG CCT 'CCA TTT GCG AGA GGT ACG GGG AGA 96 

Asn Asn He He Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg 
20 25 30 

TAT TAT AAA GCT TTT AAA ATC ACA GAT CGT ATT TGG ATA ATA CCG GAA 14 4 

55 Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Trp He He Pro Glu 

35 40 45 

AGA TAT ACT TTT GGA TAT AAA CCT GAG GAT TTT AAT AAA AGT TCC GGT 192 
Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 
60 50 55 6 0 

ATT TTT AAT AGA GAT GTT TGT GAA TAT TAT GAT CCA GAT TAC TTA AAT 24 0 

He Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 

65 70 75 80 

65 

ACT AAT GAT AAA AAG AAT ATA TTT TTA CAA ACA ATG ATC AAG TTA TTT 288 

Thr Asn Asp Lys Lys Asn He Phe Leu Gin Thr Met lie Lys Leu Phe 

85 90 95 

70 AAT AGA ATC AAA TCA AAA CCA TTG GGT GAA AAG TTA TTA GAG ATG ATT 3 36 
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20 



40 



45 



60 
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70 



Asn Arg lie Lys Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 
100 105 110 

ATA AAT GGT ATA CCT TAT CTT GGA GAT AGA CGT GTT CCA CTC GAA GAG 384 
He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 
115 120 125 

TTT AAC ACA AAC ATT GCT AGT GTA ACT GTT AAT AAA TTA ATC AGT AAT 4 3^ 

Phe Asn Thr Asn He Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 
130 135 X40 

CCA GGA GAA GTG GAG CGA AAA AAA GGT ATT TTC GCA AAT TTA ATA ATA 48 0 

Pro Gly Glu Val Glu Arg Lyn Lys Gly He Phe Ala Asn Leu He He 

155 160 

TTT GGA CCT GGG CCA GTT TTA AAT GAA AAT GAG ACT ATA GAT ATA GGT 
Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr He Asp lie Gly 
165 170 175 



528 



ATA CAA AAT CAT TTT GCA TCA AGG GAA GGC TTC GGG GGT ATA ATG CAA 5 76 

He Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly He Met Gin 
180 185 190 

ATG AAG TTT TGC CCA GAA TAT GTA AGC GTA TTT AAT AAT GTT CAA GAA 624 
Met Lys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 

AAC AAA GGC GCA AGT ATA TTT AAT AGA CGT GGA TAT TTT TCA GAT CCA GJ-y 
Asn Lys Gly Ala Ser He Phe Asn Arg Arg Gly Tyr Phe Ser Asp Pro 
-10 . 215 220 

GCC TTG ATA TTA ATG CAT GAA CTT ATA CAT GTT TTA CAT GGA TTA TAT 7 20 

Ala Leu He Leu Met His Glu Leu He His Val Leu His Gly Leu Tvr 
230 235 240 

GGC ATT AAA GTA GAT GAT TTA CCA ATT GTA CCA AAT GAA AAA AAA TTT 768 
Gly He Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lys Phe 
245 250 255 

TTT ATG CAA TCT ACA GAT GCT ATA CAG GCA GAA GAA CTA TAT ACA TTT 816 
Phe Met Gin Ser Thr Asp Ala He Gin Ala Glu Glu Leu Tyr Thr Phe 

260 265 270 

GGA GGA CAA GAT CCC AGC ATC ATA ACT CCT TCT ACG GAT AAA AGT ATC 8 64 

Gly Gly Gin Asp Pro Ser He He Thr Pro Ser Thr Asp Lys Ser He 
275 280 285 



TAT GAT AAA GTT TTG CAA AAT TTT AGA GGG ATA GTT GAT AGA CTT AAC 912 

_ Tyr Asp Lys Val Leu Gin Asn Phe Arg Gly He Val Asp Arq Leu Asn 

>0 290 295 300 

AAG GTT TTA GTT TGC ATA TCA GAT CCT AAC ATT AAT ATT AAT ATA TAT 96 0 

Lys Val Leu Val Cys He Ser Asp Pro Asn He Asn He Asn He Tyr 

305 310 315 320 



AAA AAT AAA TTT AAA GAT AAA TAT AAA TTC GTT GAA GAT TCT GAG GGA 1008 
Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Gly 
325 330 335 

AAA TAT AGT ATA GAT GTA GAA AGT TTT GAT AAA TTA TAT AAA AGC TTA 1056 
Lys Tyr Sfer He Asp Val Glu Ser Phe Asp Lys Leu Tyr Lvs Ser Leu 
340 345 350 

ATG TTT GGT TTT ACA GAA ACT AAT ATA GCA GAA AAT TAT AAA ATA AAA 1104 
Met Phe Gly Phe Thr Glu Thr Asn He Ala Glu Asn Tyr Lys He Lys 
355 360 365 

ACT AGA GCT TCT TAT TTT AGT GAT TCC TTA CCA CCA GTA AAA ATA AAA 1152 
Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lys 
370 375 380 
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AAt TTA 
Asn Leu 
385 

5 TCT GAT 

Ser Asp 



AAT AAA 
10 Asn Lys 



AAG ATA 

Lys lie 

15 

GTT GAT 
Val Asp 
450 

20 

GAT GAT 
Asp Asp 
465 

25 TAT ATA 

Tyr lie 



TTA ATA 
30 Leu He 



GAT TTT 
Asp Phe 

35 

AAA ATT 

Lys He 



ACA TTT 
Thr Phe 
545 

45 HAT OCA 

Asp Ala 



TAT ATT 
50 Tyr He 



TGG GTG 
Trp Val 

55 

AAT ACT 
Asn Thr 
610 

60 

GGA TTA 

Gly Leu 
625 

65 AAT GCT 

Asn Ala 



GAA CTT 
70 Glu Leu 



TTA GAT AAT 
Leu Asp Asn 



AAA GAT ATG 
Lys Asp Met 
405 

CAA GCT TAT 
Gin Ala Tyr 
420 

CAA ATG TGT 
Gin Met Cys 
435 

AAT GAA GAT 
Asn Glu Asp 

TTA TCT AAA 
Leu Ser Lys 



GAA AAT GAC 
Glu Asn Asp 
485 

AGT AAA ATA 
Ser Lys Tie 
500 

AAT GTA GAT 
Asn Val Asp 
515 

TTT ACA GAT 
Phe Thr Asp 

CTC TTA GAT 
Leu Leu Asp 

TTA TTA TTT 
Leu Leu Phe 
565 

AAA ACT GCT 
Lys Thr Ala 
5B0 

AAA CAG ATA 

Lys Gin He 
595 

ATG GAT AAA 
Met Asp Lys 



GCT TTA AAT 
Ala Leu Asn 



TTT GAG ATT 
Phe Glu He 
645 

TTA ATA CCT 
Lgu He Pro 



GAA ATC TAT 
Glu He Tyr 
390 

GAA AAA GAA 
Glu Lys Glu 



GAA GAA ATT 
Glu Glu He 



AAA AGT GTT 
Lys Ser Val 
440 

TTG TTC TTT 
Leu Phe Phe 
455 

AAC GAA AGA 
Asn Glu Arg 
470 

TTC CCT ATA 
Phe Pro He 



GAA TTA CCA 
Glu Leu Pro 



GTT CCA GTA 
Val Pro Val 
520 

GAA AAT ACC 
Glu Asn Thr 
535 

ATA AGA GAT 
He Arg Asp 
550 

TCT AAC AAA 
Ser Asn Lys 



AAT AAA GTG 
Asn Lys Val 



GTA AAT GAT 
Val Asn Asp 
600 

ATT GCA GAT 
He Ala Asp 
615 

GTA GGA AAT 
Val Gly Asn 
630 

GCA GGA GCC 
Ala Gly Ala 



GTA GTT GGA 
Val Val Gly 



ACT ATA GAG GAA 
Thr He Glu Glu 
395 

TAT AGA GGT CAG 
Tyr Arg Gly Gin 
410 

AGC AAG GAG CAT 
Ser Lys Glu His 
425 

AAA GCT CCA GGA 
Lys Ala Pro Gly 



ATA GCT GAT AAA 
He Ala Asp Lys 
460 

ATA GAA TAT AAT 
He Glu Tyr Asn 
475 

AAT GAA TTA ATT 
Asn Glu Leu He 
490 

AGT GAA AAT ACA 
Ser Glu Asn Thr 
505 

TAT GAA AAA CAA 
Tyr Glu Lys Gin 



ATC TTT CAA TAT 
He Phe Gin Tyr 
540 

ATA AGT TTA ACA 
He Ser Leu Thr 
555 

GTT TAT TCA TTT 
Val Tyr Ser Phe 
570 

GTA GAA GCA GGA 
Val Glu Ala Gly 
585 

TTT GTA ATC GAA 
Phe Val He Glu 



ATA TCT CTA ATT 
He Ser Leu He 
620 

GAA ACA GCT AAA 
Glu Thr Ala Lys 
635 

AGT ATT CTA CTA 
Ser He Leu Leu 
650 

GCC TTT TTA TTA 
Ala Phe Leu Leu 



GGG TTT 7VAT 
Gly Phe Asn 



AAT AAA GCT 
Asn Lys Ala 
415 

TTG GCT GTA 
Leu Ala Val 
430 

ATA TGT ATT 
He Cys He 
445 

AAT AGT TTT 

Asn Ser Phe 



ACA CAG AGT 
Thr Gin Ser 



TTA GAT ACT 
Leu Asp Thr 
495 

GAA TCA CTT 
Glu Ser Leu 
510 

CCC GCT ATA 
Pro Ala He 

525 

TTA TAG TCT 
Leu Tyr Ser 



TCT TCA TTT 
Ser Ser Phe 



TTT TCT ATG 
Phe Ser Met 
575 

TTA TTT GCA 
Leu Phe Ala 

590 

GCT AAT AAA 
Ala Asn Lys 
60S 

GTT CCT TAT 
Val Pro Tyr 



GGA AAT TTT 
Gly Asn Phe 



GJ\A TTT ATA 
Glu Phe He 
655 

GAA TCA TAT 
Glu Ser Tyr 



ATA 1200 

He 

400 

ATA 1248 
He 



TAT 12 96 

Tyr 



GAT 1344 
Asp 



TCA 1392 
Ser 



AAT 1440 

Asn 

480 

GAT 14 8 8 

Asp 



ACT 1536 
Thr 



AAA 1584 
Lys 



CAG 1632 
Gin 



GAT 1680 

Asp 

560 

GAT 1728 
Asp 



GGT 1776 
Gly 



AGC 1824 
Ser 



ATA 18 72 

He 



GPiA 1920 

Glu 

640 

CCA 1968 
Pro 



ATT 2 016 

He . 



- 293 - 



wo 98^8540 



PCTAJS97/15394 



20 



30 



40 



43 



50 



60 



660 665 670 

GAC AAT AAA AAT AAA ATT ATT AAA ACA ATA GAT AAT GCT TTA ACT AAA 2064 
Asp Asn Lys Asn Lys lie He Lys Thr He Asp Asn Ala Leu Thr Lys 
675 680 685 

AG A AAT GAA AAA TGG AGT GAT ATG TAG GGA TTA ATA GTA GCG CAA TGG 2112 
Arg Asn Giu Lys Trp Ser Asp Met Tyr Gly Leu He Val Ala Gin Trp 
690 695 700 

CTC TCA ACA GTT AAT ACT CAA TTT TAT ACA ATA AAA GAG GGA ATG TAT 2160 
Leu Sftr Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tyr 
-^05 710 715 720 

AAG GCT TTA AAT TAT CAA GCA CAA GCA TTG GAA GAA ATA ATA AAA TAC 2208 
Lys Ala Leu Asn Tyr Gin Ala Gin Ala Leu Glu Glu He He Lys Tyr 
725 730 735 

AGA TAT AAT ATA TAT TCT GAA AAA GAA AAG TCA AAT ATT AAC ATC GAT 22 56 

Arg Tyr Asn He Tyr Ser Glu Lys Glu Lys Ser Asn He Asn He Asp 
740 745 750 



TTT AAT GAT ATA AAT TCT AAA CTT AAT GAG GGT ATT AAC CAA GCT ATA 23 04 

^_ Phe Asn Asp He Asn Ser Lys Leu Asn Glu Gly He Asn Gin Ala He 

755 760 765 



GAT AAT ATA AAT AAT TTT ATA AAT GGA TGT TCT GTA TCA TAT TTA ATG 2 3 52 

Asp Asn He Asn Asn Phe He Asn Gly Cys Ser Val Ser Tyr Leu Met 
770 775 780 

AAA AAA ATG ATT CCA TTA GCT GTA GAA AAA TTA CTA GAC TTT GAT AAT 24 00 

Lys Lys Met He Pro Leu Ala Val Glu Lys Leu Leu Asp Phe Asp Asn 
785 790 795 800 

ACT CTC AAA AAA AAT TTG TTA AAT TAT ATA GAT GAA AAT AAA TTA TAT 2448 

Thr Leu Lys Lys Asn Leu Leu Asn Tyr lie Asp Glu Asn Lys Leu Tyr 

805 810 815 

TTG ATT GGA AGT GCA GAA TAT GAA AAA TCA AAA GTA AAT AAA TAC TTG 24 96 

Leu He Gly Ser Ala Glu Tyr Glu Lys Ser Lys Val Asn Lys Tyr Leu 
820 825 830 

AAA ACC ATT ATG CCG TTT GAT CTT TCA ATA TAT ACC AAT GAT ACA ATA 2 544 

Lys Thr He Met Pro Phe Asp Leu Ser He Tyr Thr Asn Asp Thr He 
835 ' 840 845 

CTA ATA G/^iA ATG TTT AAT AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT 2 592 

Leu He Giu Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He 
850 855 860 

ATC TTA AAT TTA AGA TAT AAG GAT AAT AAT TTA ATA GAT TTA TCA GGA 2640 

He Leu Asn Leu Arg Tyr Lys Asp Asn Asn Leu He Asp Leu Ser Gly 
865 870 875 880 

55 TAT GGG GCA AAG GTA GAG GTA TAT GAT GGA GTC GAG CTT AAT GAT AAA 2 6 88 

Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Glu Leu Asn Asp Lys 

885 890 895 



AAT CAA TTT AAA TTA ACT AGT TCA GCA AAT AGT AAG ATT AGA GTG ACT 2 7 36 

Asn Gin Phe Lys Leu Thr Ser Ser Ala Asn Ser Lys He Arg Val Thr 
900 905 910 



CAA AAT CAG AAT ATC ATA TTT AAT AGT GTG TTC CTT GAT TTT AGC GTT 2 784 

Gin Asn Gin Asn He He Phe Asn Ser Val Phe Leu Asp Phe Ser Val 
6:> 915 920 925 



70 



AGC TTT TGG ATA AGA ATA CCT AAA TAT AAG AAT GAT GGT ATA CAA AAT 2832 
Ser Phe Trp He Arg He Pro Lys Tyr Lys Asn Asp Gly He Gin Asn 
930 935 940 
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TAT ATT CAT AAT GAA TAT ACA ATA ATT AAT TGT ATG AAA AAT AAT TCG 288 0 

Tyr lie His Asn Glu Tyr Thr lie He Asn Cys Met Lys Asn Asn Ser 
945 950 955 960 

5 GGC TGG AAA ATA TCT ATT AGG GGT AAT AGG ATA ATA TGG ACT TTA ATT 2928 

Gly Trp Lys He Ser He Arg Gly Asn Arg He He Trp Thr Leu He 
965 970 975 

GAT ATA AAT GGA AAA ACC AAA TCG GTA TTT TTT GAA TAT AAC ATA AG A 2976 
10 Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arg 

980 985 990 

GAA GAT ATA TCA GAG TAT ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT 3024 
Glu Asp He Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr 
15 995 1000 1005 

AAT PAT TTG AAT AAC GCT AAA ATT TAT ATT AAT GGT AAG CTA GAA TCA 3072 
Asn Asn Leu Asn Asn Ala Lys He Tyr He Asn Gly Lys Leu Glu Ser 
1010 1015 1020 

20 

AAT ACA GAT ATT AAA GAT ATA AGA GAA GTT ATT GCT AAT GGT GAA ATA 312 0 

Asn Thr Asp He Lys Asp He Arg Glu Val He Ala Asn Gly Glu He 
1025 1030 1035 1040 

25 ATA TTT AAA TTA GAT GGT GAT ATA GAT AGA ACA CAA TTT ATT TGG ATG 3168 

lie Phe Lys Leu Asp Gly Asp He Asp Arg Thr Gin Phe He Trp Met 
1045 1050 1055 

AAA TAT TTC AGT ATT TTT AAT ACG GAA TTA AGT CAA TCA AAT ATT GAA 3216 
30 Lys Tyr Phe Ser He Phe Asn Thr Glu Leu Ser Gin Ser Asn He Glu 

1060 1065 1070 

GAA AGA TAT AAA ATT CAA TCA TAT AGC GAA TAT TTA AAA GAT TTT TGG 3 2 64 

Glu Arg Tyr Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp 
35 1075 1080 1085 

GGA AAT CCT TTA ATG TAC AAT AAA GAA TAT TAT ATG TTT AAT GCG GGG 3 312 

Gly Asn Pro Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly 

1090 1095 1100 

40 

AAT AAA AAT TCA TAT ATT AAA CTA AAG AAA GAT TCA CCT GTA GGT GAA 3 36 0 

Asn Lys Asn Ser Tyr He Lys Leu Lys Lys Asp Ser Pro Val Gly Glu 

1105 1110 1115 1120 

45 ATT TTA ACA CGT AGC AAA TAT AAT CAA AAT TCT AAA TAT ATA AAT TAT 34 08 

lie Leu Thr Arg Ser Lys Tyr Asn Gin Asn Ser Lys Tyr He Asn Tyr 
1125 1130 * 113S 

AGA GAT TTA TAT ATT GGA GAA AAA TTT ATT ATA AGA AGA AAG TCA AAT 34 56 

50 Arg Asp Leu Tyr He Gly Glu Lys Phe He He Arg Arg Lys Ser Asn 

1140 1145 1150 

TCT CAA TCT ATA AAT GAT GAT ATA GTT AGA AAA GAA GAT TAT ATA TAT 3 504 

ser Gin Ser He Asn Asp Asp He Val Arg Lys Glu Asp Tyr He Tyr 
55 1155 1160 1165 

CTA GAT TTT TTT AAT TTA AAT CAA GAG TGG AGA GTA TAT ACC TAT AAA 3 552 

Leu Asp Phe Phe Asn Leu Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys 
1170 1175 1180 



60 



TAT TTT AAG AAA GAG GAA GAA AAA TTG TTT TTA GCT CCT ATA AGT GAT 3600 
Tyr Phe Lys Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asp 
1185 1190 1195 1200 



65 TCT GAT GAG TTT TAC AAT ACT ATA CAA ATA AAA GAA TAT GAT GAA CAG 3648 

Ser Asp Glu Phe Tyr Asn Thr He Gin He Lys Glu Tyr Asp Glu Gin 
1205 1210 1215 
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CCA ACA TAT AGT TGT CAG TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT 3696 
Pro Thr Tyr Ser Cys Gin beu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

GAT GAG ATA GGA TTG ATT GGT ATT CAT CGT TTC TAC GAA TCT GGA ATT 3 744 

Asp Glu lie Gly Leu lie Gly lie His Arg Phe Tyr Glu Ser Gly lie 
1235 1240 1245 

GTA TTT GAA GAG TAT AAA GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA 3792 
Val Phe Glu Glu Tyr Lys Asp Tyr Phe Cys lie Ser Lys Trp Tyr Leu 
1250 1255 1260 

AAA GAG GTA AAA AGG AAA CCA TAT AAT TTA AAA TTG GGA TGT AAT TGG 3B40 
Lys Glu Val Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp 
1265 1270 1275 1280 

CAG TTT ATT CCT AAA GAT GAA GGG TGG ACT GAA TAA 3876 
Gin Phe lie Pro Lys Asp Glu Gly Trp Thr Glu 
1285 1290 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 ammo acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: Ixnear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Met Pro Val Thr lie Asn Asn Phe Asn Tyr Asn Asp Pro He Asp Asn 
15 10 15 

Asn Asn He lie Met Met Glu Pro Pro Phe Ala Arg Gly Thr Gly Arg 
20 25 30 

Tyr Tyr Lys Ala Phe Lys He Thr Asp Arg He Trp He He Pro Glu 
35 40 45 

Arg Tyr Thr Phe Gly Tyr Lys Pro Glu Asp Phe Asn Lys Ser Ser Gly 
50 55 60 

He Phe Asn Arg Asp Val Cys Glu Tyr Tyr Asp Pro Asp Tyr Leu Asn 
65 70 75 80 

Thr Asn Asp Lys Lys Asn He Phe Leu Gin Thr Met He Lys Leu Phe 
85 90 95 

Asn Arg He Lvs Ser Lys Pro Leu Gly Glu Lys Leu Leu Glu Met He 
100 105 110 

He Asn Gly He Pro Tyr Leu Gly Asp Arg Arg Val Pro Leu Glu Glu 
115 120 125 

Phe Asn Thr Asn He Ala Ser Val Thr Val Asn Lys Leu He Ser Asn 
130 135 140 

Pro Gly Glu Val Glu Arg Lys Lys Gly He Phe Ala Asn Leu He He 
145 150 155 160 
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Phe Gly Pro Gly Pro Val Leu Asn Glu Asn Glu Thr lie Asp lie Gly 
1^5 170 175 

lie Gin Asn His Phe Ala Ser Arg Glu Gly Phe Gly Gly lie Met Gin 
ISO 185 190 

Met Uys Phe Cys Pro Glu Tyr Val Ser Val Phe Asn Asn Val Gin Glu 
195 200 205 

Asn Lys Gly Ala Ser lie Phe Asn Arg Arq Gly Tyr Phe Ser Asp Pro 
210 215 220 

Ala Lgu He Leu Met His Glu Leu He His Val Leu His Gly Leu Tvr 
225 230 235 240 

Gly He Lys Val Asp Asp Leu Pro He Val Pro Asn Glu Lys Lys Phe 
245 250 255 

Phe Met Gin Ser Thr Asp Ala He Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

Gly Gly Gin Asp Pro Ser He He Thr Pro Ser Thr Asp Lys Ser He 
275 280 285 

Tyr Asp Lys Val Leu Gin Asn Phe Arq Gly He Val Asp Arg Lf-u Asn 
290 295 . 300 

Lys Val Leu Val Cys He Ser Asp Pro Asn He Asn He Asn He Tyr 
305 310 315 320 

Lys Asn Lys Phe Lys Asp Lys Tyr Lys Phe Val Glu Asp Ser Glu Giv 
325 330 335 

Lys Tyr Ser lie Asp Val Glu Ser Phe Asp Lys Leu Tyr Lys Ser Leu 
340 345 350 

Met Phe Gly Phe Thr Glu Thr Asn He Ala Glu Asn .Tyr Lys He Lys 
355 360 365 

Thr Arg Ala Ser Tyr Phe Ser Asp Ser Leu Pro Pro Val Lys He Lys 
370 375 380 

Asn Leu Leu Asp Asn Glu He Tvr Thr He Glu Glu Gly Phe Asn He 
385 390 395 400 

Ser Asp Lys Asp Met Glu Lys Glu Tyr Arg Glv Gin Asn Lys Ala He 
405 ' 410 " 415 

Asn Lys Gin Ala Tyr Glu Glu He Ser Lys Glu His Leu Ala Val Tyr 
420 425 430 

Lys He Gin Met Cys Lys Ser Val Lys Ala Pro Glv He Cys He Asp 
435 440 ' 445 

Val Asp Asn Glu Asp Leu Phe Phe He Ala Asp Lys Asn Ser Phe Ser 
450 455 460 

Asp Asp Leu Ser Lys Asn Glu Arg He Glu Tyr Asn Thr Gin Ser Asn 
465 470 475 480 

Tyr He Glu Asn Asp Phe Pro He Asn Glu Leu He Leu Asp Thr Asp 
485 490 495 

Leu He Ser Lys He Glu Leu Pro Ser Glu Asn Thr Glu Ser Leu Thr 
500 505 510 

Asp Phe Asn Val Asp Val Pro Val Tyr Glu Lys Gin Pro Ala He Lvs 
515 520 525 

Lys He Phe Thr Asp Glu Asn Thr He Phe Gin Tyr Leu Tyr Ser Gin 
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530 535 540 

Thr Phe Leu Leu Asp He Arg Asp He Ser Leu Thr Ser Ser Phe Aso 

555 

Asp Ala Leu Leu Phe Ser Asn Lys Val Tyr Ser Phe Phe Ser Met Asd 
^65 570 ^ 

Tyr He Lys Thr Ala Asn Lys Val Val Glu Ala Gly Leu Phe Ala Glv 
580 585 590 

Trp Val Lys Gin He Val Asn Asp Phe Val He Glu Ala Asn Lvs Ser 
595 600 605 

Asn Thr Met Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr He 
610 615 620 

Gly Leu Ala Leu Asn Val Gly Asn Glu Thr Ala Lys Gly Asn Phe Glu 

635 640 

Asn Ala Phe Glu He Ala Gly Ala Ser He Leu Leu Glu Phe He Pro 
645 650 

Glu Leu Leu He Pro Val Val Gly Ala Phe Leu Leu Glu Ser Tyr He 
660 665 670 

Asp Asn Lys Asn Lys He He Lys Thr He Asp Asn Ala Leu Thr Lvs 
675 680 685 

Arg Asn Glu Lys Trp Ser Asp Met Tyr Gly Leu He Val Ala Gin Trn 
690 695 700 ^ 

Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He Lys Glu Gly Met Tvr 
710 715 720 

Lys Ala Leu Asn Tyr Gin Al;^ Gin Ala Leu Glu Glu He He Lys Tvr 
7^5 730 ' 

Arg Tyr Asn He Tyr Ser Glu Lys Glu Lys Ser Asn He Asn He Asp 
'MO 745 750 

Phe Asn Asp He Asn Ser Lys Leu Asn Glu Gly He Asn Gin Ala He 
755 760 765 

Asp Asn He Asn Asn Phe He Asn Gly Cys Ser Val Ser Tyr Leu Met 
770 775 780 

Lys Lys Met He Pro Leu Ala Val Glu Lys Leu Leu Asp Phe Asp Asn 
785 790 795 gOO 

Thr Leu Lys Lys Asn Leu Leu Asn Tyr He Asp Glu Asn Lys Leu Tyr 
S05 810 815 

Leu He Gly Ser Ala Glu Tyr Glu Lys Ser Lys Val Asn Lys Tyr Leu 
820 825 830 

Lys Thr He Met Pro Phe Asp Leu Ser He Tyr Thr Asn Asp Thr He 
^35 840 845 

Leu He Glu Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He 
850 855 860 

lie Leu Asn Leu Arg Tyr Lys Asp Asn Asn Leu He Asp Leu Ser Glv 
865 870 875 880 

Tyr Gly Ala Lys Val Glu Val Tyr Asp Gly Val Glu Leu Asn Asp Lys 
885 890 895 

Asn Gin Phe Lys Leu Thr Ser Ser Ala Asn Ser Lys He Arg Val Thr 
900 905 910 
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Gin Asn Gin Asn He He Phe Asn Ser Val Phe Leu Asp Phe Ser Val 
915 920 925 

Ser Phe Trp He Arg He Pro Lys Tyr Lys Asn Asp Gly He Gin Asn 
930 935 940 

Tyr He His Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser 
945 950 955 960 

Gly Trp Lys He Ser He Arg Gly Asn Arg He He Trp Thr Leu He 
965 970 975 

Asp He Asn Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arg 
9B0 985 990 

Glu Asp He Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr 
995 1000 1005 

Asn Asn Leu Asn Asn Ala Lys He Tyr lie Asn Gly Lys Leu Glu Ser 
1010 1015 1020 

Asn Thr Asp He Lys Asp He Arg Glu Val He Ala Asn Gly Glu He 
1025 1030 1035 1040 

He Pne Lys Leu Asp Gly Asp He Asp Arg Thr Gin Phe He Trp Met 
1045 1050 1055 

Lys Tyr Phe Ser He Phe Asn Thr Glu Leu Ser Gin Ser Asn He Glu 
1060 1065 1070 

Glu Arg Tyr Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Pho Trp 
1075 1080 1085 

Gly Asn Pro Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly 
1090 1095 1100 



Asn Lys Asn Ser Tyr He Lys Leu Lys Lys Asp Ser Pro Val Gly Glu 

1105 1110 ' 1115 ^ 1120 

He Leu Thr Arg Ser Lys Tyr Asn Gin Asn Ser Lys . Tyr He Asn Tyr 

1125 1130 1135 

Arg Asp Leu Tyr He Gly Glu Lys Phe He He Arg Arg Lys Ser Asn 

1140 ' 1145 1150 

Ser Gin Ser He Asn Asp Asp He Val Arg Lys Glu Asp Tyr lie Tyr 

1155 1160 1165 



Leu Asp Phe Phe Asn Leu Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys 
1170 1175 1180 

Tyr Phe Lys Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asp 
1185 1190 1195 1200 

Ser Asp Glu Phe Tyr Asn Thr He Gin He Lys Glu Tyr Asp Glu Gin 
1205 1210 1215 

Pro Thr Tyr Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr 
1220 1225 1230 

Asp Glu He Gly Leu He Gly He His Arg Phe Tyr Glu Ser Gly He 
1235 1240 1245 

Val Phe Glu Glu Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu 
1250 ' ' 1255 1260 

Lys Glu Val Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp 
1265 1270 1275 1280 

Gin Phe He Pro Lys Asp Glu Gly Trp Thr Glu 
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10 



40 



45 



60 



65 



70 



1285 1290 
(2) INFORMATION FOR SEQ ID NO; 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
tD) TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



t ix ) FEATURE : 
i> (A) NAME/KEY: CDS 

(B) LOCATION: 108 1523 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 4 3 : 

20 AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 60 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 

25 ' 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 
His Hxs His His His His His His His Ser Ser Glv His lie Glu Glv 
^ 10 15 

30 CGT CAT ATG GCT AGC ATG GCT GAT AC A ATA CTA ATA GAA ATG TTT AAT 

Arg His Met Ala Ser Met Ala Asp Thr lie Leu lie Glu Met Phe Asn 
-0 25 30 35 



TTT AAT AGT ATG TTC CTT GAT TTT AGC GTT AGC TTT TGG ATA AGG ATA 
Phe Asn Ser Met Phe Leu Asp Phe Ser Val Ser Phe Trp lie Arg lie 
100 lOB 110 * 115 



212 



AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT ATC TTA AAT TTA AGA TAT 260 

Lys Tyr Asn Ser Glu lie Leu Asn Asn He He Leu Asn Leu Arg Tyr 
40 45 50 

AGA GAT AAT AAT TTA ATA GAT TTA TCA GGA TAT GGA GCA AAG GTA GAG 308 

Arg Asp Asn Asn Leu He Asp Leu Ser Gly Tyr Gly Ala Lys Val Giu 
55 60 65 

GTA TAT GAT GGG GTC AAG CTT AAT GAT AAA AAT CAA TTT AAA TTA ACT 356 

Val Tyr Asp Gly Val Lys Leu Asn Asp Lys Asn Gin Phe Lys Leu Thr 
70 75 80 

AGT TCA GCA GAT AGT AAG ATT AGA GTC ACT CAA AAT CAG AAT ATT ATA 404 

Ser Ser Ala Asp Ser Lys He Arg Val Thr Gin Asn Gin Asn He He 
85 90 95 



452 



CCT AAA TAT AGG AAT GAT GAT ATA CAA AAT TAT ATT CAT 7VAT GAA TAT 500 

Pro Lys Tyr Arg Asn Asp Asp He Gin Asn Tyr He His Asn Glu Tyr 
120 125 130 

ACG ATA ATT AAT TGT ATG AAA AAT AAT TCA GGC TGG AAA ATA TCT ATT 54 8 

Thr He He Asn Cys Met Lys Asn Asn Ser Gly Trp Lys He Ser He 
135 140 145 

AGG GGT AAT AGG ATA ATA TGG ACC TTA ATT GAT ATA AAT GGA AAA ACC 596 

Arg Gly Asn Arg He He Trp Thr Leu He Asp He Asn Gly Lys Thr 
150 155 160 

AAA TCA GTA TTT TTT GAA TAT AAC ATA AGA GAA GAT ATA TCA GAG TAT 644 

Lys Ser Val Phe Phe Glu Tyr Asn He Arg Glu Asp He Ser Glu Tyr 
165 170 175 

ATA AAT AGA TGG TTT TTT GTA ACT ATT ACT AAT AAT TTG GAT AAT GCT 6 92 
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lie Asn Arg Trp Phe Phe Val Thr lie Thr Asn Asn Leu Asp Asn Ala 
180 185 190 195 

AAA ATT TAT ATT AAT GGC ACG TTA GAA TCA AAT ATG GAT ATT AAA GAT 74 0 

5 Lys lie Tyr lie Asn Gly Thr Leu Glu Ser Asn Met Asp lie Lys Asp 

200 205 210 

ATA GGA GAA GTT ATT GTT AAT GGT GAA ATA ACA TTT AAA TTA GAT GGT 78 8 

lie Gly Glu Val He Val Asn Gly Glu iLe Thr Phe Lys Leu Asp Gly 
H) 215 220 225 

GAT GTA GAT AGA ACA CAA TTT ATT TGG ATG AAA TAT TTT AGT ATT TTT 83 6 

Asp Val Aap Arg Thr Gin Phe lie Trp Met Lys Tyr Phe Ser He Phe 
230 235 ' ' 240 

15 

AAT ACG CAA TTA AAT CAA TCA AAT ATT AAA GAG ATA TAT AAA ATT CAA 884 

Asn Thr Gin Leu Asn Gin Ser Asn He Lys Glu He Tyr Lys lie Gin 
245 250 255 

20 TCA TAT AGC GAA TAC TTA AAA GAT TTT TGG GGA AAT CCT TTA ATG TAT 93 2 

Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro Leu Met Tyr 
260 ' 265 270 275 

AAT AAA GAA TAT TAT ATG TTT AAT GCG GGG AAT AAA AAT TCA TAT ATT 98 0 

25 Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lys Asn Ser Tyr lie 

280 285 290 

AAA CTA GTG AAA GAT TCA TCT GTA GGT GAA ATA TTA ATA CGT AGC AAA X02 8 

Lys Leu Val Lys Asp Ser Ser Val Gly Glu lie Leu He Arg Ser Lys 
.>() ' 295 300 305 

TAT AAT CAG AAT TCC AAT TAT ATA AAT TAT AGA AAT TTA TAT ATT GGA 1076 
Tyr Asn Gin Asn Ser Asn Tyr He Asn Tyr Arg Asn Leu Tyr He Giy 
310 315 320 

35 

GAA AAA TTT ATT ATA AGA AGA GAG TCA AAT TCT CAA TCT ATA AAT GAT 112 4 

Glu Lys Phe He He Arg Arg Glu Ser Asn Ser Gin Ser He Asn Asp 
325 330 335 

40 GAT ATA GTT AGA AAA GAA GAT TAT ATA CAT CTA GAT TTG GTA CTT CAC 1172 

Asp He Val Arg Lys Glu Asp Tyr He His Leu Asp Leu Val Leu His 
340 345 350 355 

CAT GAA GAG TGG AGA GTA TAT GCC TAT AAA TAT TTT AAG GAA CAG GAA 1220 
4> His Glu Glu Trp Arg Val Tyr Ala Tyr Lys Tyr Phe Lys Glu Gin Glu 

360 * 365 ' 370 

GAA AAA TTG TTT TTA TCT ATT ATA AGT GAT TCT AAT GAA TTT TAT AAG 1268 
Glu Lys Leu Phe Leu Ser He He Ser Asp Ser Asn Glu Phe Tyr Lys 
50 375 3B0 385 

ACT ATA GAA ATA AAA GAA TAT GAT GAA CAG CCA TCA TAT AGT TGT CAG ' 1316 

Thr He Glu He Lys Glu Tyr Asp Glu Gin Pro Ser Tyr Ser Cys Gin 
390 395 400 

55 

TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT GAT GAT ATA GGA TTG ATT 1364 

Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Asp He Gly Leu He 
405 410 415 

60 GGT ATT CAT CGT TTC TAC GAA TCT GGA GTT TTA CGT AAA AAG TAT AAA 1412 

Gly He His Arg Phe Tyr Glu Ser Gly Val Leu Arg Lys Lys Tyr Lys 
420 425 ' 430 435 

GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA AAA GAG GTA AAA AGG AAA 1460 
65 Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val Lys Arg Lys 

440 445 450 

CCA TAT AAG TCA AAT TTG GGA TGT AAT TGG CAG TTT ATT CCT AAA GAT 1508 
Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp Gin Phe He Pro Lys Asp 
70 455 460 465 
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GAA GGG TGG ACT GAA TAA 
Glu Gly Trp Thr Glu 
470 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
IS 10 15 

lie Glu Gly Arg His Met Ala Ser Met Ala Asp Thr lie Leu He Glu 
20 25 30 

Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He He Leu Asn 
35 40 45 

Leu Arg Tyr Arg Asp Asn Asn Leu He Asp Leu Ser Gly Tyr Glv Ala 
50 55 60 

Lys Val Glu Val Tyr Asp Gly Val Lys Leu Asn Asp Lys Asn Gin Phe 
&5 70 75 80 

Lys Leu Thr Ser Ser Ala Asp Ser Lys He Arg Val Thr Gin Asn Gin 
85 90 95 

Asn He He Phe Asn Ser Met Phe Leu Asp Phe Ser Val Ser Phe Trp 
100 105 110 

He Arg He Pro Lys Tyr Arg Asn Asp Asp He Gin Asn Tyr He His 
115 120 125 

Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser Gly Trp Lys 
130 135 140 

He Ser He Arg Gly Asn Arg He He Trp Thr Leu He Asp He Asn 
145 150 155 160 

r,ly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn lie Arg Glu Asp He 
165 170 175 

Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr Asn Asn Leu 
180 IBS 190 

Asp Asn Ala Lys He Tyr He Asn Gly Thr Leu Glu Ser Asn Met Asp 
195 200 205 

He Lys Asp He Gly Glu Val He Val Asn Gly Glu He Thr Phe Lys 
210 215 220 

Leu Asp Gly Asp Val Asp Arg Thr Gin Phe He Trp Met Lvs Tvr Phe 
225 230 235 * * 240 

Ser He Phe Asn Thr Gin Leu Asn Gin Ser Asn He Lys Glu He Tyr 
245 250 ' 255 

Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro 
260 265 270 

Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lys Asn 
275 ' 280 285 



Ser Tyr He Lys Leu Val Lys Asp Ser Ser Val Gly Glu He Leu He 
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290 295 300 

Arg Ser Lys Tyr Asn Gin Asn Ser Asn Tyr He Asn Tyr Arg Asn Leu 
305 310 315 320 

Tyr lie Gly Glu Lys Phe He He Arg Arg Glu Ser Asn Ser Gin Ser 
325 330 335 

He Asn Asp Asp He Val Arg Lys Glu Asp Tyr He His Leu Asp Leu 
340 345 350 

Val Leu His His Glu Glu Trp Arg Val Tyr Ala Tyr Lys Tvr Plie Lys 
355 360 365 

Glu Gin Glu Glu Lys Leu Phe Leu Ser He He Ser Asp Ser Asn Glu 
370 375 380 

Phe Tyr Lys Thr He Glu He Lys Glu Tyr Asp Glu Gin Pro Ser Tyr 
385 390 395 400 

Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Asp He 
405 410 415 

Gly Leu He Gly He His Arg Phe Tyr Glu Ser Gly Val Leu Arg Lvs 
420 425 430 

Lys Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val 
435 440 445 

Lys Arg Lys Pro Tyr Lys Ser Asn Leu Gly Cys Asn Trp Gin Phe He 
'150 455 460 

Pro Lys Asp Glu Gly Trp Thr Glu 
465 470 

(2) INFORMATION FOR SEQ ID NO: 45: 

ii) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 108.. 1523 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4 6: 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 60 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 
His His His His His His His His His Ser Ser Gly His He Glu Gly 

^ 10 15 

CGT CAT ATG GCT AGC ATG GCT GAT ACA ATA CTA ATA GAA ATG TTT AAT 212 
Arg His Met Ala Ser Met Ala Asp Thr He Leu He Glu Met Phe Asn 
20 25 30 35 

AAA TAT AAT AGC GAA ATT TTA AAT AAT ATT ATC TTA AAT TTA AG A TAT 260 
Lys Tyr Asn Ser Glu He Leu Asn Asn He He Leu Asn Leu Arg Tyr 
40 45 50 



AAG GAT AAT AAT TTA ATA GAT TTA. TCA GGA TAT GGG GCA AAG GTA GAG 
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Lys Asp Asn 



GTA TAT GAT 
5 Val Tyr Asp 

70 

AGT TCA GCA 
Ser Ser Ala 
10 85 

TTT AAT AGT 
Phe Asn Ser 
100 

15 

CCT AAA TAT 
Pro Lys Tyr 

20 ACA ATA ATT 

Thr He He 



AGG GGT AAT 
25 Arg Gly Asn 

150 

AAA TCG GTA 
Lys Ser Val 
30 165 

ATA AAT AGA 
lie Asn Arg 
180 

35 

AAA ATT TAT 

Lys He Tyr 

40 ATA AGA GAA 

He Arg Glu 



<;AT ATA GAT 
45 Asp He Asp 

230 

AAT ACG GAA 
Asn Thr Glu 
50 245 

TCA TAT AGC 
Ser Tyr Ser 
260 

55 

AAT AAA GAA 
Asn Lys Glu 

60 AAA CTA AAG 

Lys Leu Lys 



TAT AAT CAA 
65 Tyr Asn Gin 

310 



Asn Leu He Asp Leu 
55 

GGA GTC GAG CTT AAT 
Gly Val Glu Leu Asn 
75 

AAT AGT AAG ATT AGA 
Asn Ser Lys He Arg 

90 

GTG TTC CTT GAT TTT 
Vai Phe Leu Asp Phe 
105 

AAG AAT GAT GGT ATA 
Lys Asn Asp Gly He 
120 

AAT TGT ATG AAA AAT 
Asn Cys Met Lys Asn 

135 

AGG ATA ATA TGG ACT 
Arg He He Trp Thr 
155 

TTT TTT GAA TAT AAC 
Phe Phe Glu Tyr Asn 
170 

TGG TTT TTT GTA ACT 
Trp Phe Phe Val Thr 
185 

ATT AAT GGT AAG CTA 
He Asn Gly Lys Leu 
200 

GTT ATT OCT AAT GGT 
Val He Ala Asn Gly 
21S 

AGA ACA CAA TTT ATT 
Arg Thr Gin Phe He 

235 

TTA AGT CAA TCA AAT 
Leu Ser Gin Ser Asn 
250 

GAA TAT TTA AAA GAT 
Glu Tyr Leu Lys Asp 
265 

TAT TAT ATG TTT AAT 
Tyr Tyr Met Phe Asn 
280 

AAA GAT TCA CCT GTA 
Lys Asp Ser Pro Val 
295 

AAT TCT AAA TAT ATA 

Asn Ser Lys Tyr He 
315 



Ser Gly Tyr Gly Ala 
60 

GAT AAA AAT CAA TTT 
Asp Lys Asn Gin Phe 

ao 

GTG ACT CAA AAT CAG 
Val Thr Gin Asn Gin 

95 

AGC GTT AGC TTT TGG 

Ser Val Ser Phe Trp 
110 

CAA AAT TAT ATT CAT 
Gin Asn Tyr He His 
125 

AAT TCG GGC TGG AAA 
Asn Ser Gly Trp Lys 
140 

TTA ATT GAT ATA AAT 
Leu He Asp He Asn 
160 

ATA AGA GAA GAT ATA 
He Arg Glu Asp He 
175 

ATT ACT AAT AAT TTG 
He Thr Asn Asn Leu 
190 

GAA TCA AAT ACA GAT 
Glu Ser Asn Thr Asp 
205 

GAA ATA ATA TTT AAA 
Glu He He Phe Lys 

220 

TGG ATG AAA TAT TTC 
Trp Met Lys Tyr Phe 
240 

ATT GAA GAA AGA TAT 
He Glu Glu Arg Tyr 
255 

TTT TGG GGA AAT CCT 
Phe Trp Gly Asn Pro 
270 

GCG GGG AAT AAA AAT 
Ala Gly Asn Lys Asn 
2B5 

GGT GAA ATT TTA ACA 
Gly Glu He Leu Thr 
300 

AAT TAT AGA GAT TTA 
Asn Tyr Arg Asp Leu 
320 



Lys Val Glu 

65 

AAA TTA ACT 3 56 

Lys Leu Thr 



AAT ATC ATA 404 
Asn He He 



ATA ACA ATA 4 52 

He Arg He 
115 

AAT GAA TAT SOO 
Asn Glu Tyr 
130 

ATA TCT ATT 548 

He Ser He 

145 

GGA AAA ACC 596 
Gly Lys Thr 



TCA GAG TAT 64 4 

Ser Glu Tyr 



AAT AAC GCT 6 92 

Asn Asn Ala 

195 

ATT AAA GAT 740 
He Lvs Asp 
210 

TTA GAT GGT 7 88 

Leu Asp Gly 

225 

AGT ATT TTT 8 36 

Ser He Phe 



AAA ATT CAA 884 
Lys He Gin 



TTA ATG TAC , 93 2 

Leu Met Tyr 
275 

TCA TAT ATT 980 

Ser Tyr He 
290 

CGT AGC AAA 1028 

Arg Ser Lys 

305 

TAT ATT GGA 1076 
Tyr He Gly 
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GAA AAA TTT ATT ATA AGA AGA AAG TCA AAT TCT CAA TCT ATA AAT GAT 
Glu Lys Phe lie He Arg Arg Lys Ser Asn Ser Gin Ser lie Asn Asp 
325 330 335 

GAT ATA GTT AGA AAA GAA GAT TAT ATA TAT CTA GAT TTT TTT AAT TTA 
Asp He Val Arg Lys Glu Asp Tyr He Tyr Leu Asp Phe Phe Asn Leu 
340 345 350 355 

AAT CAA GAG TGG AGA GTA TAT ACC TAT AAA TAT TTT AAG AAA GAG GAA 
Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys Tyr Phe Lys Lys Glu Glu 
360 365 370 

GAA AAA TTG TTT TTA GCT CCT ATA AGT GAT TCT GAT GAG TTT TAG AAT 
Glu Lys Leu Phe Leu Ala Pro He Ser Asp Ser Asp Glu Phe Tyr Asn 
375 380 305 

ACT ATA CAA ATA AAA GAA TAT GAT GAA CAG CCA ACA TAT AGT TGT CAG 
Thr He Gin He Lys Glu Tyr Asp Glu Gin Pro Thr Tyr Ser Cys Gin 
390 395 4O0 

TTG CTT TTT AAA AAA GAT GAA GAA AGT ACT GAT GAG ATA GGA TTG ATT 
Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asp Glu He Gly Leu He 
'105 410 415 

GGT ATT CAT CGT TTC TAC GAA TCT GGA ATT GTA TTT GAA GAG TAT AAA 
Gly He His Arg Phe Tyr Glu Ser Gly He Val Phe Glu Glu Tyr Lys 
•120 425 430 435 

GAT TAT TTT TGT ATA AGT AAA TGG TAC TTA AAA GAG GTA AAA AGG AAA 
Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val Lys Arg Lys 
440 445 450 

CCA TAT AAT TTA AAA TTG GGA TGT AAT TGG CAG TTT ATT CCT AAA GAT 
Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp Gin Phe He Pro Lys Asp 
455 460 465 

GAA GGG TGG ACT GAA T/UUU\GCTTG CGGCCGCACT CGAG 
Glu Gly Trp Thr Glu 
470 

t2) INFOR^4ATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(>:i) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
15 10 15 

He Glu Gly Arg His Met Ala Ser Met Ala Asp Thr He Leu He Glu 

20 25 30 

Met Phe Asn Lys Tyr Asn Ser Glu He Leu Asn Asn He He Leu Asn 
35 40 45 

Leu Arg Tyr Lys Asp Asn Asn Leu He Asp Leu Ser Gly Tvr Gly Ala 
50 55 60 ' 

Lys Val Glu Val Tyr Asp Gly Val Glu Leu Asn Asp Lys Asn Gin Phe 
65 70 75 80 

Lys Leu Thr Ser Ser Ala Asn Ser Lys He Arg Val Thr Gin Asn Gin 
85 ' 90 95 

Asn He He Phe Asn Ser Val Phe Leu Asp Phe Ser Val Ser Phe Trp 
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100 los 110 

He Arg He Pro Lys Tyr Lys Asn Asp Gly He Gin Asn Tyr lie His 
115 120 125 

Asn Glu Tyr Thr He He Asn Cys Met Lys Asn Asn Ser Gly Trp Lvs 

135 140 

He Ser He Arg Gly Asn Arg He He Trp Thr Leu He Asp He Asn 
150 155 160 

Gly Lys Thr Lys Ser Val Phe Phe Glu Tyr Asn He Arg Glu Asp He 
165 170 175 

Ser Glu Tyr He Asn Arg Trp Phe Phe Val Thr He Thr Asn Asn Leu 
180 185 190 

Asn Asn Ala Lys He Tyr He Asn Gly Lys Leu Glu Ser Asn Thr Asp 
195 200 205 

He Lys Asp He Arg Glu Val He Ala Asn Gly Glu He He Phe Lv- 
210 215 220 

Leu Asp Gly Asp He Asp Arg Thr Gin Phe He Trp Met Lvs Tvr Phe 
225 230 235 240 

Ser He Phe Asn Thr Glu Leu Ser Gin Ser Asn He Glu Glu Arg Tyr 
245 250 255 

Lys He Gin Ser Tyr Ser Glu Tyr Leu Lys Asp Phe Trp Gly Asn Pro 
260 265 270 

Leu Met Tyr Asn Lys Glu Tyr Tyr Met Phe Asn Ala Gly Asn Lys Asn 
275 280 285 

Ser Tyr He Lys Leu Lys Lys Asp Ser Pro Val Gly Glu He Leu Thr 
290 295 300 

Arg Ser Lys Tyr Asn Gin Asn Ser Lys Tyr He Asn Tyr Arg Asp Leu 
305 310 315 320 

Tyr He Gly Glu Lys Phe He He Arg Arg Lys Ser Asn Ser Gin Ser 
325 330 335 

He Asn Asp Asp He Val Arg Lys Glu Asp Tyr He Tyr Leu Asp Phe 
340 345 350 

Phe Asn Leu Asn Gin Glu Trp Arg Val Tyr Thr Tyr Lys Tyr Phe Lvs 
355 360 365 

Lys Glu Glu Glu Lys Leu Phe Leu Ala Pro He Ser Asp Ser Asp Glu 
370 375 380 

Phe Tyr Asn Thr He Gin He Lys Glu Tyr Asp Glu Gin Pro Thr Tyr 
385 390 395 400 

Ser Cys Gin Leu Leu Phe Lys Lys Asp Glu Glu Ser Thr Asd Glu He 
405 410 " 415 

Gly Leu He Gly He His Arg Phe T>'r Glu Ser Gly He Val Phe Glu 
420 425 430 

Glu Tyr Lys Asp Tyr Phe Cys He Ser Lys Trp Tyr Leu Lys Glu Val 
435 440 445 

Lys Arg Lys Pro Tyr Asn Leu Lys Leu Gly Cys Asn Trp Gin Phe He 
450 455 460 

Pro Lys Asp Glu Gly Trp Thr Glu 
465 470 
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(2) INFORMATION FOR SEQ ID NO: 47; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base paxrs 
> <B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
CA) DESCRIPTION: /desc = "DNA" 

(x.i) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CGCCATGGCT GATACAATAC TAATAGAAAT G 31 

(2) INFORMATION FOR SEQ ID NO: 48: 



15 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
20 (B) TYPE: nucleic acid 

fC) STRANDEDNESS; single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GCAAGCTTTT ATTCAGTCCA CCCTTCATC 

30 

(2) INFORMATION FOR SEQ ID NO: 49: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3753 base pairs 
-^^ (B) TYPE: nucleic acid 

<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



40 



50 



60 



(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3750 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 



ATG CCA ACA ATT AAT AGT TTT AAT TAT AAT GAT CCT GTT AAT AAT AGA 4 8 

Met Pro Thr lie Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asn Arg 

15 10 15 

ACA ATT TTA TAT ATT AAA CCA GGC GGT TGT CAA CAA TTT TAT AAA TCA 96 

Thr lie Leu Tyr lie Lys Pro Gly Gly Cys Gin Gin Phe Tyr Lys Ser 

20 25 30 

5.^ TTT AAT ATT ATG AAA AAT ATT TGG ATA ATT CCA GAG AGA AAT GTA ATT 144 

Phe Asn He Met Lys Asn lie Trp He He Pro Glu Arg Asn Val He 

35 40 45 



GGT ACA ATT CCC CAA GAT TTT CTT CCG CCT ACT TCA TTG AAA AAT GGA 192 
Gly Thr He Pro Gin Asp Phe Leu Pro Pro Thr Ser Leu Lys Asn Gly 
50 55 60 



GAT AGT AGT TAT TAT GAC CCT AAT TAT TTA CAA AGT GAT CAA GAA AAG 24 0 

Asp Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Gin Glu Lvs 
O:? 65 70 75 _ 80 



70 



GAT AAA TTT TTA AAA ATA GTC ACA AAA ATA TTT AAT AGA ATA AAT GAT 288 
Asp Lys Phe Leu Lys He Val Thr Lys He Phe Asn Arg He Asn Asp 
85 90 95 
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AAT CTT TCA 
Asn Leu Ser 



TAT TTA GGA 
Tyr Leu Gly 
115 

GCA TCA GCA 
10 Ala Ser Ala 

130 

TTA CCT AAT 
Leu Pro Asn 
15 145 

AAC AGT TCC 
Asn Ser Ser 

20 

GGT TTT GGA 
Gly Phe Gly 

25 AGA TTT AAA 

Arg Phe Lvs 
195 

ACA TTA ATG 
30 Thr Leu Met 

210 

AAA GGG ATT 
Lys Gly lie 
J 5 22 5 

ATA ACA AAT 
He Thr Asn 

40 

GGT ACT GAT 
Gly Thr Asp 

45 ACT AAT CTT 

Thr Asn Leu 
275 

GTA CAA GTA 
50 val Gin Val 

290 

GCA AAG TAT 
Ala Lys Tyr 
55 305 

ATA AAC AAA 
He Asn Lys 

60 

TTT GAT TTA 
Phe Asp Leu 

65 GGA CAG TAT 

Gly Gin Tyr 
355 

TAT AAT ATA 
70 Tyr Asn He 



GGA AGG ATT TTA TTA 
Gly Arg He Leu Leu 
100 

AAT GAT AAT ACT CCA 
Asn Asp Asn Thr Pro 
120 

GTT CCA ATT CAA TTC 
Val Pro He Gin Phe 
135 

GTT ATT ATA ATG GGA 
Val He He Met Gly 
150 

AAT ATT TCT CTA AGA 
Asn He Ser Leu Arg 
165 

TCA ATA GCT ATA GTA 
Ser He Ala He Val 

lao 

GAT AAT AGT ATG AAT 
Asp Asn Ser Met Asn 
200 

CAT GAA TTA ATA CAT 
His Glu Leu He His 
215 

ACT ACA AAG TAT ACT 
Thr Thr Lys Tyr Thr 
230 

ATA AGA GGT ACA AAT 
He Arg Gly Thr Asn 
245 

TTA AAC ATT ATT ACT 
Leu Asn He He Thr 
260 

CTA GCT GAT TAT AAA 
Leu Ala Asp Tyr Lys 
2B0 

TCT AAT CCA CTA CTT 
Ser Asn Pro Leu Leu 
295 

GGA TTA GAT AAA GAT 
Gly Leu Asp L/s Asp 
310 

TTT AAT GAT ATT TTT 
Phe Asn Asp He Phe 

325 

GCA ACT AAA TTT CAA 
Ala Thr Lys Phe Gin 
340 

AAA TAC TTC AAA CTT 
Lys Tyr Phe Lys Leu 
360 

TCA GAA GGC TAT AAT 
Ser Glu Gly Tyr Asn 



GAA GAA CTG TCA AAA 
Glu Glu Leu Ser Lys 
105 

GAT GGT GAC TTC ATT 
Asp Gly Asp Phe He 
125 

TCA AAT GGT AGC CAA 
Ser Asn Gly Ser Gin 
140 

GCA GAG CCT GAT TTA 
Ala Glu Pro Asp Leu 
155 

AAT AAT TAT ATG CCA 
Asn Asn Tyr Met Pro 
170 

ACA TTC TCA CCT GAA 
Thr Phe Ser Pro Glu 
185 

GAA TTT ATT CAA GAT 
Glu Phe He Gin Asp 
205 

TCA TTA CAT GGA CTA 
Ser Leu His Gly Leu 
220 

ATA ACA CAA AAA CAA 
He Thr Gin Lys Gin 
235 

ATT GAA GAA TTC TTA 
He Glu Glu Phe Leu 
250 

AGT GCT CAG TCC AAT 
Ser Ala Gin Ser Asn 
265 

AAA ATA GCG TCT AAA 
Lys He Ala Ser Lys 
285 

AAT CCT TAT AAA GAT 
Asn Pro Tyr Lys Asp 
300 

GCT AGC GGA ATT TAT 
Ala Ser Gly He Tyr 
315 

AAA AAA TTA TAC AGC 
Lys Lys Leu Tyr Ser 
330 

GTT AAA TGT AGG CAA 
Val Lys Cys Arg Gin 
345 

TCA AAC TTG TTA AAT 
Ser Asn Leu Leu Asn 
365 

ATA AAT AAT TTA AAG 
He Asn Asn Leu Lys 



GCT AAT CCA 3 36 

Ala Asn Pro 

110 

ATT AAT GAT 3 84 

He Asn Asp 



AGC ATA CTA 432 
Ser He Leu 



TTT GAA ACT 480 
Phe Glu Thr 
160 

AGC AAT CAC 528 
Ser Asn His 
175 

TAT TCT TTT S76 

Tyr Ser Phe 

190 

CCT GCT CTT 6 24 

Pro Ala Leu 



TAT GGG GCT 672 
Tyr Gly Ala 



AAT CCC CTA 720 
Asn Pro Leu 
240 

ACT TTT GGA 76 8 

Thr Phe Gly 
255 

GAT ATC TAT 816 

Asp He Tyr 

270 

CTT AGC AAA 864 
Leu Ser Lys 



GTT TTT GAA 912 
Val Phe Glu 



TCG GTA AAT 960 
Ser Val Asn 
320 

TTT ACG GAA 1008 
Phe Thr Glu 
335 

ACT TAT ATT 1056 

Thr Tyr He 

350 

GAT TCT ATT 1104 
Asp Ser He 



GTA AAT TTT 1152 
Val Asn Phe 



- 308 - 



wo 98/08540 



PCTAJS97/15394 



10 



370 375 380 

AGA GGA CAG AAT GCA AAT TTA AAT CCT AGA ATT ATT ACA CCA ATT ACA 1200 

Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg lie lie Thr Pro He Thr 
385 390 395 400 

GGT AGA GGA CTA GTA AAA AAA ATC ATT AGA TTT TGT AAA AAT ATT GTT 124 8 

Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn He Val 

405 410 415 

TCT GTA AAA GGC ATA AGG AAA TCA ATA TGT ATC GAA ATA AAT AAT GGT 1296 

Ser Val Lys Gly He Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 430 

15 GAG TTA TTT TTT GTG GCT TCC GAG AAT AGT TAT AAT GAT GAT AAT ATA 134 4 

Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
435 440 ' 445 

AAT ACT CCT AAA GAA ATT GAC GAT ACA GTA ACT TCA AAT AAT AAT TAT 13 92 

20 Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 

450 455 460 

GAA AAT GAT TTA GAT CAG GTT ATT TTA AAT TTT AAT AGT GAA TCA GCA 14 4 0 

Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
25 ^65 470 475 480 

CCT GGA CTT TCA GAT GAA AAA TTA AAT TTA ACT ATC CAA AAT GAT GCT 148 8 

Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 

485 490 495 

30 

TAT ATA CCA AAA TAT GAT TCT AAT GGA ACA AGT GAT ATA GAA CAA CAT 1536 

Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
500 505 510 

35 GAT GTT AAT GAA CTT AAT GTA TTT TTC TAT TTA GAT GCA CAG AAA GTG 1584 

Asp Val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lys Val 
515 520 525 

CCC GAA GGT GAA AAT AAT GTC AAT CTC ACC TCT TCA ATT GAT ACA GCA 16 3 2 

40 Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 

530 535 540 

TTA TTA GAA CAA CCT AAA ATA TAT ACA TTT TTT TCA TCA GAA TTT ATT L680 

Leu Leu Glu Gin Pro Lys He Tyr Thr Phe Phe Ser Ser Glu Phe He 
45 545 550 555 560 

AAT AAT GTC AAT AAA CCT GTG CAA GCA GCA TTA TTT GTA AGC TGG ATA 1728 

Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 

565 570 575 

50 

CAA CAA GTA TTA GTA GAT TTT ACT ACT GAA GCT AAC CAA AAA AGT ACT 1776 

Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 590 

55 GTT GAT AAA ATT GCA GAT ATT TCT ATA GTT GTT CCA TAT ATA GGT CTT 1824 

Val Asp Lys He Ala Asp He Ser He Val Val Pro Tyr He Gly Leu 
595 600 605 

GCT TTA AAT ATA GGA AAT GAA GCA CAA AAA GGA AAT TTT AAA GAT GCA 18 72 

60 Ala Leu Asn He Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 

610 615 620 

CTT GAA TTA TTA GGA GCA GGT ATT TTA TTA GAA TTT GAA CCC GAG CTT 1920 

Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 
63 625 630 635 640 

TTA ATT CCT ACA ATT TTA GTA TTC ACG ATA AAA TCT TTT TTA GGT TCA 1968 

Leu He Pro Thr He Leu Val Phe Thr He Lys Ser Phe Leu Gly Ser 

645 650 655 

70 
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TCT GAT AAT AAA AAT AAA GTT ATT AAA GCA ATA AAT AAT GCA TTG AAA 2016 
Ser Asp Asn Lys Asn Lys Val lie Lys Ala He Asn Asn Ala Leu Lys 
660 665 670 

5 GAA AGA GAT GAA AAA TGG AAA GAA GTA TAT AGT TTT ATA GTA TCG AAT 2064 

Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe He Val Ser Asn 
675 680 685 

TGG ATG ACT AAA ATT AAT ACA CAA TTT AAT AAA AGA AAA GAA CAA ATG 2112 
10 Trp Met Thr Lys He Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 

690 695 700 

TAT CAA GCT TTA CAA AAT CAA GTA AAT GCA CTT AAA GCA ATA ATA GAA 2160 
Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala Leu Lys Ala He He Glu 
15 705 710 715 720 

TCT AAG TAT AAT AGT TAT ACT TTA GAA GAA AAA AAT GAG CTT ACA AAT 2208 

Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 
725 730 735 

20 

AAA TAT GAT ATT GAG CAA ATA GAA AAT GAA CTT AAT CAA AAG GTT TCT 2256 

Lys Tyr Asp He Glu Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 
740 745 750 

25 ATA GCA ATG AAT AAT ATA GAC AGG TTC TTA ACT GAA AGT TCT ATA TCT 2304 

He AJa Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 
755 760 765 

TAT TTA ATG AAA TTA ATA AAT GAA GTA AAA ATT AAT AAA TTA AGA GAA 2 3 52 

30 Tyr Leu Met Lys Leu He Asn Glu Val Lys He Asn Lys Leu Arg Glu 

770 775 780 

TAT GAT GAA AAT GTT AAA ACG TAT TTA TTA GAT TAT ATT ATA AAA CAT 24 00 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asp Tyr He He Lys His 
35 785 790 ' 795 800 

GGA TCA ATC TTG GGA GAG AGT CAG CAA GAA CTA AAT TCT ATG GTA ATT 244 8 

Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val He 
805 810 815 

40 

GAT ACC CTA AAT AAT AGT ATT CCT TTT AAG CTT TCT TCT TAT ACA GAT 24 96 

Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tyr Thr Asp 

820 825 830 

45 GAT AAA ATT TTA ATT TCA TAT TTT AAT AAG TTC TTT AAG AGA ATT AAA 2 544 

Asp Lys He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 
835 ' 840 845 

AGT AGT TCT GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT 2592 
50 Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 

850 855 B60 

ACT TCA GGA TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA 2640 
Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 
55 865 870 . 875 880 

TAT CCA ACT AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT 2688 
Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
885 890 895 



60 



GAA GTT AAT ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT 27 36 

Glu Val Asn He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr 
900 905 910 



65 AAA AAT TTT AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT 2 784 

Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 
915 920 925 

AAG ATA GTA AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGG 2832 
70 Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg 



- 310 - 



WO98AI8540 



PCT/USy7/15394 



10 



930 935 940 

GAT AAT AAT TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT 2880 
Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 
945 950 955 960 

TGG ACA TTG CAA GAT AAT TCA GGA ATT AAT CAA AAA TTA GCA TTT AAC 2928 
Trp Thr Leu Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 975 

TAT GGT AAC GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT 2976 
Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
980 985 990 

15 GTA ACT ATA ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT 3024 

Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
995 iOOO 1005 

GGA AAT TTA ATA GAT AAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT 3072 
20 Gly Asn Leu He Asp Lys Lys Ser He Leu Asn Leu Gly Asn He His 

1010 1015 1020 

GTT AGT GAC AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA 3120 
Val Ser Asp Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg 
25 1025 1030 1035 1040 

TAT ATT GGT ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA 3168 

Tyr ile Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 1050 1055 

M) 

ACA GAA ATT CAA ACT TTA TAT AAC AAT GAA CCT AAT GCA AAT ATT TTA 3216 

Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn lie Leu 
1060 1065 1070 

35 AAG GAT TTT TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA 3264 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr. Tyr Leu 
1075 1080 1085 

TTA AAT GTG TTA AAA CCA AAT AAC TTT ATT AAT AGG AGA ACA GAT TCT 3312 
40 Leu Asn Val Leu Lys Pro Asn Asn Phe Ile Asn Arg Arg Thr Asp Ser 

1090 1095 1100 

ACT TTA AGC ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA 3 360 

Thr Leu Ser He Asn Asn He Arg Ser Thr lie Leu Leu Ala Asn Arg 
45 1105 1110 1115 1120 

TTA TAT AGT GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT 3408 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
1125 1130 1135 

50 

ACT AAC GAT AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT 3456 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr lie Asn Phe 

1140 1145 1150 

55 GTA GCC AGC AAA ACT CAC TTA CTT CCA TTA TAT GCT GAT ACA GCT ACC 3504 

Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala .Asp Thr Ala Thr 
1155 1160 1165 

ACA AAT AAA GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT 3552 
60 Thr Asn Lys Glu Lys Thr lie Lys Ile Ser Ser Ser Gly Asn Arg Phe 

1170 - 1175 1180 

AAT CAA GTA GTA GTT ATG AAT TCA GTA GGA TGT ACA ATG AAT TTT AAA 3600 
Asn Gin Val Val Val Met Asn Ser Val Gly Cys Thr Met Asn Phe Lys 
65 1185 1190 1195 1200 
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AAT AAT AAT GGA AAT AAT ATT GGG TTG TTA GGT TTC AAG GCA GAT ACT 364 B 

Asn Asn Asn Gly Asn Asn lie Gly Leu Leu Gly Phe Lvs Ala Asp Thr 
1205 1210 ' ' 1215 

GTA GTT GCT AGT ACT TGG TAT TAT ACA CAT ATG AGA GAT AAT ACA AAC 36 96 

Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr Asn 
1220 1225 1230 

AGC AAT GGA TTT TTT TGG AAC TTT ATT TCT GAA GAA CAT GGA TGG CAA 3744 
Ser Asn Gly Phe Phe Trp Asn Phe He Ser Glu Glu His Gly Trp Gin 
1235 1240 1245 



GAA AAA TAA 
Glu Lys 
1250 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1250 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Pro Thr lie Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asn Arg 
1 B 10 15 

Thr He Leu Tyr He Lys Pro Gly Gly Cys Gin Gin Phe Tyr Lys Ser 
20 25 30 

Phe Asn He Met Lys Asn He Trp He He Pro Glu Arg Asn Val He 
35 40 45 

Gly Thr He Pro Gin Asp Phe Leu Pro Pro Thr Ser Leu Lys Asn Gly 
50 55 60 

Asp Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Gin Glu Lys 
65 70 75 80 

Asp Lys Phe Leu Lys He Val Thr Lys He Phe Asn Arg He Asn Asp 
85 90 95 

Asn Leu Ser Gly Arq He Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
100 105 110 

Tyr Leu Gly Asn Asp Asn Thr Pro Asp Gly Asp Phe He He Asn Asp 
115 120 125 

Ala Ser Ala Val Pro He Gin Phe Ser Asn Gly Ser Gin Ser He Leu 
130 135 140 

Leu Pro Asn Val He He Met Gly Ala Glu Pro Asp Leu Phe Glu Thr 
145 150 155 160 

Asn Ser Ser Asn He Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
165 170 175 

Gly Phe Gly Ser He Ala He Val Thr Phe Ser Pro Glu Tyr Ser Phe 
180 185 190 

Arg Phe Lys Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
195 200 205 

Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
210 215 220 

Lys Gly He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 



3753 
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225 230 235 240 

lie Thr Asn lie Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
245 250 255 

Gly Thr Asp Leu Asn He He Thr Ser Ala Gin Ser Asn Asp He Tyr 
260 265 270 

Thr Asn Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lys Leu Ser Lys 
275 280 285 

Val Gin Val Ser Asn Pro Leu Leu Asn Pro Tyr Lys Asp Val Ehe Glu 
290 295 300 

Ala Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly He Tyr Ser Val Asn 
305 310 315 320 

He Asn Lys Phe Asn Asp He Phe Lys Lys Leu Tyr Ser Phe Thr Glu 
325 330 335 

Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
340 345 350 

Gly Gin Tvr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser He 
355 360 365 

Tyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
370 375 380 

Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He He Thr Pro He Thr 
385 390 395 400 

Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn He Val 
405 410 415 

Ser Val Lys Gly He Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 430 

Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
435 440 445 

Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 
450 455 460 

Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
465 470 475 480 

Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 
485 490 495 

Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin Hxs 
500 505 510 

Asp Val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lys Val 
515 520 525 

Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 
530 535 540 

Leu Leu Glu Gin Pro Lys He Tyr Thr Phe Phe Ser Ser Glu Phe He 
545 550 ' 555 560 

Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 
565 570 575 

Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 590 

Val Asp Lys lie Ala Asp He Ser He Val Val Pro Tyr He Gly Leu 
595 600 605 
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Ala Leu Asn lie Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 
610 615 620 

Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 
625 630 635 640 

Leu He Pro Thr He Leu Val Phe Thr He Lys Ser Phe Leu Gly Ser 
645 650 655 

Ser Asp Asn Lys Asn Lys Val He Lys Ala He Asn Asn Ala Leu Lys 
660 665 670 

Glu Arg Asp Glu Lys Trp Lys Glu val Tyr Ser Phe He Val Ser Asn 
675 680 685 

Trp Met Thr Lys He Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 
690 695 700 

Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala Leu Lys Ala He He Glu 
705 710 715 720 

Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 
725 730 735 

Lys Tyr Asp He Glu Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 
740 745 750 

He Ala Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 
755 760 765 

Tyr Leu Met Lys Leu He Asn Glu Val Lys He Asn Lys Leu Arg Glu 
770 775 780 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asp Tyr He He Lys His 
785 790 795 800 

Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val He 
805 810 815 

Asp Thr Leu Asn P^sn Ser He Pro Phe Lys Leu Ser Ser Tyr Thr Asp 
820 825 830 

Asp Lys He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 
835 840 845 

Ser Ser Sex' Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 
850 855 860 

Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 
865 870 875 880 

Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
885 890 895 

Glu Val Asn He Ser Gin Asn Asp Tyr He lie Tyr Asp Asn Lys Tyr 
900 905 910 

Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 
915 920 925 

Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg 
930 935 940 

Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 
945 950 955 960 

Trp Thr Leu Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn 
965 970 975 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
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980 985 990 

Val Thr lie Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr lie Asn 
995 1000 1005 

Gly Asn Leu lie Asp Lys Lys Ser lie Leu Asn Leu Gly Asn lie His 
1010 1015 1020 

in ^^-^ Asn Cys Ser Tyr Thr Arg 

1025 1030 1035 1040 



15 



20 



30 



50 



65 



70 



Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu A^p Glu 
1045 1050 1055 

Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 
1060 1065 1070 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
1075 1080 1085 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Asp Ser 
1090 1095 1100 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg 
1105 1110 1115 1120 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
H25 1130 1135 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
1140 1145 1150 

Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 1160 1165 

Thr Asn Lys Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arg Phe 
1170 1175 iiBO 



Asn Gin Val Val Val Met Asn Ser Val Gly Cys Thr Met Asn Phe Lys 
40 1165 1190 1195 1200 

Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala Asp Thr 
1205 1210 1215 

4> Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr Asn 

1220 1225 1230 



Ser Asn Gly Phe Phe Trp Asn Phe He Ser Glu Glu His Gly Trp Gin 
1235 1240 1245 

Glu Lys 
1250 

(2) INFORMATION FOR 5EQ ID NO: 51: 



(x) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
60 (D) TOPOLOGY: linear 



Hi) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .3756 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51: 
ATG CCA AAA ATT AAT AGT TTT AAT TAT AAT GAT CCT GTT AAT GAT AGA 48 
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10 



15 



20 



40 



50 



>3 



60 



65 



70 



Met Pro Lys lie Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp Arg 
5 10 15 

ACA ATT TTA TAT ATT AAA CCA GGC GGT TGT CAA GAA TTT TAT AAA TCA 

Thr rie Leu Tyr lie Lys Pro Gly Gly Cys Gin Glu Phe Tyr Lys Ser 
20 25 30 



^ Tyr Leu Gly Asn Asp Asn Thr Pro Asp Asn Gin Phe His lie Gly Asp 



TTA CCT AAT GTT ATT ATA ATG GGA GCA GAG CCT GAT TTA TTT GAA ACT 
Leu Pro Asn Val He He Met Gly Ala Glu Pro ASD Leu Phe Glu Thr 
^'^^ 150 155 " 160 



96 



xTT AAT ATT ATG AAA AAT ATT TGG ATA ATT CCA GAG AGA AAT GTA ATT 144 
Phe Asn He Met Lys Asn He Trp He He Pro Glu Arg Asn Val He 
35 40 45 

GGT ACA ACC CCC CAA GAT TTT CAT CCG CCT ACT TCA TTA AAA AAT GGA 192 
Gly Thr Thr Pro Gin Asp Phe His Pro Pro Thr Ser Leu Lys Asn Glv 
50 55 60 

GAT AGT AGT TAT TAT GAC CCT AAT TAT TTA CAA AGT GAT GAA GAA AAG 24 0 

Asp Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Glu Glu Lvs 
^5 ^0 75 80 

GAT AGA TTT TTA AAA ATA GTC ACA AAA ATA TTT AAT AGA ATA AAT AAT 2B8 
Asp Arg Phe Leu Lys He Val Thr Lys He Phe Asn Arg He Asn Asn 
85 90 95 

AAT CTT TCA GGA GGG ATT TTA TTA GAA GAA CTG TCA AAA GCT AAT CCA 3 36 

Asn Leu Ser Gly Gly He Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
100 105 110 



TAT TTA GGG AAT GAT AAT ACT CCA GAT AAT CAA TTC CAT ATT GGT GAT 384 

Pro Asp Asn Gin Phe His 
120 125 



GCA TCA GCA GTT GAG ATT AAA TTC TCA AAT GGT AGC CAA GAC ATA CTA 43:; 
Ala Ser Ala Val Glu He Lys Phe Ser Asn Gly Ser Gin Asp He Leu 
130 135 140 



480 



AAC AGT TCC AAT ATT TCT CTA AGA AAT AAT TAT ATG CCA AGC AAT CAC 528 
Asn Ser Ser Asn He Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn Hxs 
165 170 175 



GGT TTT GGA TCA ATA GCT ATA GTA ACA TTC TCA CCT GAA TAT TCT TTT 57 6 

-+> Gly Phe Gly Ser He Ala He Val Thr Phe Ser Pro Glu Tyr Ser Phe 

180 185 190 



AGA TTT AAT GAT AAT AGT ATG AAT GAA TTT ATT CAA GAT CCT GCT CTT 624 
Arg Phe Asn Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
195 200 205 

ACA TTA ATG CAT GAA TTA ATA CAT TCA TTA CAT GGA CTA TAT GGG GCT 6 72 

Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Gly Ala 
210 215 220 

AAA GGG ATT ACT ACA AAG TAT ACT ATA ACA CAA AAA CAA AAT CCC CTA 72 0 

Lys Giy He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro L^u 
225 230 235 240 

ATA ACA AAT ATA AGA GGT ACA AAT ATT GAA GAA TTC TTA ACT TTT GGA 768 
He Thr Asn He Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
245 250 255 

GGT ACT GAT TTA AAC ATT ATT ACT AGT GCT CAG TCC AAT GAT ATC TAT 816 
Gly Thr Asp Leu Asn He He Thr Ser Ala Gin Ser Asn Asp He Tyr 
260 265 270 

ACT AAT CTT CTA GCT GAT TAT AAA AAA ATA GCG TCT AAA CTT AGC AAA 864 
Thr Asn Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lvs Leu Ser Lys 
275 280 285 
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GTA CAA GTA TCT AAT CCA CTA CTT AhT CCT TAT AAA GAT GTT TTT GAA 912 
Val Gin Val Ser Asn Pro Leu Leu Asn Pro Tyr Lys Asp Val Phe Glu 
290 295 300 

5 GCA AAG TAT GGA TTA GAT AAA GAT GCT AGC GGA ATT TAT TCG GTA AAT 96 0 

Ala Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly He Tyr Ser Val Asn 
305 310 31S 320 

ATA AAC AAA- TTT AAT GAT ATT TTT AAA AAA TTA TAG AGC TTT ACG GAA 1008 
10 He Asn Lys Phe Asn Asp He Phe Lys Lys Leu Tyr Ser Phe Thr Glu 

325 330 335 

TTT GAT TTA GCA ACT AAA TTT CAA GTT AAA TGT AGG CAA ACT TAT ATT 1056 
Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
15 340 345 350 

GGA CAG TAT AAA TAC TTC AAA CTT TCA AAC TTG TTA AAT GAT TCT ATT 1104 

Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser He 
355 360 365 

20 

TAT AAT ATA TCA GAA GGC TAT AAT ATA AAT AAT TTA AAG GTA AAT TTT 1152 

Tyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
370 375 380 

25 AGA GGA CAG AAT GCA AAT TTA AAT CCT AGA ATT ATT ACA CCA ATT ACA 1200 

Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He He Thr Pro He Thr 
385 390 395 400 

GGT AGA GGA CTA GTA AAA AAA ATC ATT AGA TTT TGT AAA AAT ATT GTT 124 B 

->0 Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn He Val 

405 410 415 

TCT GTA AAA GGC ATA AGG AAA TCA ATA TGT ATC GAA ATA AAT AAT GGT 1296 
Ser Val Lys Gly lie Arg Lys Ser He Cys He Glu He Asn Asn Gly 
35 420 425 430 

GAG TTA TTT TTT GTG GCT TCC GAG AAT AGT TAT AAT GAT GAT AAT ATA 134 4 

Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
435 440 445 

40 

AAT ACT CCT AAA GAA ATT GAC GAT ACA GTA ACT TCA AAT AAT AAT TAT 1392 

Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 
450 455 460 

45 GAA AAT GAT TTA GAT CAG GTT ATT TTA AAT TTT AAT AGT GAA TCA GCA 144 0 

Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
465 470 475 480 

CCT GGA CTT TCA GAT GAA AAA TTA AAT TTA ACT ATC CAA AAT GAT GCT 14 BB 

50 Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 

485 490 495 

TAT ATA CCA AAA TAT GAT TCT AAT GGA ACA AGT GAT ATA GAA CAA CAT 153 6 

Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
55 500 505 510 

GAT GTT AAT GAA CTT AAT GTA TTT TTC TAT TTA GAT GCA CAG AAA GTG 1584 

Asp Val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lys Vai 

515 520 52S 

60 

CCC GAA GGT GAA AAT AAT GTC AAT CTC ACC TCT TCA ATT GAT ACA GCA 1632 

Pro Glu Glv Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 

530 " 535 540 

65 TTA TTA GAA CAA CCT AAA ATA TAT ACA TTT TTT TCA TCA GAA TTT ATT 1680 

Leu Leu Glu Gin Pro Lys He Tyr Thr Phe Phe Ser Ser Glu Phe He 
545 550 555 560 

AAT AAT GTC AAT AAA CCT GTG CAA GCA GCA TTA TTT GTA AGC TGG ATA 17 2 B 

70 Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Trp He 
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10 



30 



40 



45 



50 



565 570 

CAA CAA GTG TTA GTA GAT TTT ACT ACT GAA OCT AAC CAA AAA AGT ACT 1776 
Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 590 

GTT GAT AAA ATT GCA GAT ATT TCT ATA GTT GTT CCA TAT ATA GGT CTT 1824 
Val Asp Lys He Ala Asp He Ser He Val Val Pro Tyr He Gly Leu 
595 600 605 

GCT TTA AAT ATA GGA AAT GAA GCA CAA AAA GGA AAT TTT AAA GAT GCA 1B72 
Aia Leu Asn He Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 
SIO 615 620 



!:> CTT GAA TTA TTA GGA GCA GGT ATT TTA TTA GAA TTT GAA CCC GAG CT^ 

Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 
630 635 640 



1920 



TTA ATT CCT ACA ATT TTA GTA TTC ACG ATA AAA TCT TTT TTA GGT TCA 196B 
Leu He Pro Thr He Leu Val Phe Thr He Lys Ser Phe Leu Gly Ser 
645 650 655 

TCT GAT AAT AAA AAT AAA GTT ATT AAA GCA ATA AAT AAT GCA TTG AAA 2016 
Ser Asp Asn Lys Asn Lys Val He Lys Ala He Asn Asn Ala Leu Lys 
660 66& 670 

GAA AGA GAT GAA AAA TGG AAA GAA GTA TAT AGT TTT ATA GTA TCG AAT 2 064 

Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe He Val Ser Asn 
575 680 685 

TGG ATG ACT AAA ATT AAT ACA CAA TTT AAT AAA AGA AAA GAA CAA ATG 2112 
Trp Met Thr Lys He Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 
690 695 700 



TAT CAA GCT TTA CAA AAT CAA GTA AAT GCA ATT AAA ACA ATA ATA GAA 216 0 

Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala He Lys Thr He He Glu 
'^05 710 715 720 



TCT AAG TAT AAT AGT TAT ACT TTA GAG GAA AAA AAT GAG CTT ACA AAT 2208 

Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 
■725 730 735 

AAA TAT GAT ATT AAG CAA ATA GAA AAT GAA CTT AAT CAA AAG GTT TCT 22 5 G 

Lys Tyr Asp He Lys Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 

740 745 750 

ATA GCA ATG AAT AAT ATA GAC AGG TTC TTA ACT GAA AGT TCT ATA TCC 2 304 

He Ala Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 
755 760 765 

TAT TTA ATG AAA TTA ATA AAT GAA GTA AAA ATT AAT AAA TTA AGA GAA 2 3 52 

Tyr Leu Met Lys Leu He Asn Glu Val Lys He Asn Lys Leu Arg Glu 
770 775 780 

^5 TAT GAT GAG AAT GTC AAA ACG TAT TTA TTG AAT TAT ATT ATA CAA CAT 24 00 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asn Tyr He He Gin His 

785 790 795 800 

GGA TCA ATC TTG GGA GAG AGT CAG CAA GAA CTA AAT TCT ATG GTA ACT 2448 

60 Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val Thr 

805 810 815 

GAT ACC CTA AAT AAT AGT ATT CCT TTT AAG CTT TCT TCT TAT ACA GAT 24 96 

Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tyr Thr Asp 

B20 825 830 

GAT AAA ATT TTA ATT TCA TAT TTT AAT AAA TTC TTT AAG AGA ATT AAA 2544 

Asp Lys He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 
835 840 845 
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AGT AGT TCA GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAG GTA GAT 2592 
Ser Ser Ser Val Leu Asn Met Arg Tyr hys Asn Asp Lys Tyr Val Asp 
850 855 860 

5 ACT TCA GGA TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA 2640 

Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 
865 870 875 880 

TAT CCA ACT AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT 2688 

10 Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 

885 890 895 

f 

GAA GTT AAT ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT 2736 

Glu Val Asn He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr 
l:» 900 905 910 

AAA AAT TTT AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT 2784 
Lys Asn Phe Ser He Ser Phe Trp Val Arg lie Pro Asn Tyr Asp Asn 
915 920 925 

20 

AAG ATA GTA AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGA 2832 
Lys He Val Asn Val Asn Asn Glu Tyr Thr lie He Asn Cys Met Arg 
930 935 940 

25 GAT AAT AAT TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT 2880 

Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 
945 950 955 960 

TGG ACA TTG CAA GAT AAT GCA GGA ATT AAT CAA AAA TTA GCA TTT AAC 2 928 

30 Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 

965 970 975 

TAT GOT AAC GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT 2 976 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
35 980 985 990 

GTA ACT ATA ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT 7VAT 3024 

Val Thr He Thr , Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 

995 1000 1005 

40 

GGA AAT TTA ATA GAT CAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT 3 072 

Gly Asn Leu He Asp Gin Lys Ser He Leu Asn Leu Gly Asn He His 
1010 1015 1020 

45 GTT AGT GAC AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA 312 0 

Val Ser Asp Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg 
1025 1030 1035 1040 

TAT ATT GGT ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA 3168 
50 Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 

1045 1050 1055 

ACA GAA ATT CAA ACT TTA TAT AGC AAT GAA CCT AAT ACA AAT ATT TTG 3216 
Thr Glu He Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu 
55 1060 1065 1070 

AAG GAT TTT TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA 3264 
Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
1075 1080 1085 



60 



TTA AAT GTG TTA AAA CCA AAT AAC TTT ATT GAT AGG AGA AAA GAT TCT 3 312 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asp Arg Arg Lys Asp Ser 
1090 1095 1100 



65 ACT TTA AGC ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA 3 36 0 

Thr Leu Ser He Asn Asn He Arg Ser Thr lie Leu Leu Ala Asn Arg 

1105 1110 1115 1120 

TTA TAT AGT GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT 3406 

70 Leu Tyr ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
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1125 1130 X135 

ACT AAC GAT AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT 3456 
Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
1140 1145 1150 

GTA GCC AGC AAA ACT CAC TTA TTT CCA TTA TAT GCT GAT ACA GCT ACC 3504 
Vai Ala Ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 1160 1165 

ACA AAT AAA GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT 3552 
Thr Asn Lys Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arg Phe 
1170 1175 1180 

15 AAT CAA GTA GTA GTT ATG AAT TCA GTA GGA AAT AAT TGT ACA ATG AAT 3600 

Asn Gin Vai Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn 
1185 1190 1195 1200 

TTT AAA AAT AAT AAT GGA AAT AAT ATT GGG TTG TTA GGT TTC AAG GCA 3648 
20 Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala 

1205 1210 1215 

GAT ACT GTA GTT GCT AGT ACT TGG TAT TAT ACA CAT ATG AGA GAT CAT 3696 
Asp Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Mrs 
25 1220 1225 1230 

ACA AAC AGC AAT GGA TGT TTT TGG AAC TTT ATT TCT GAA GAA CAT GGA 374 4 

Thr Asn Ser Asn Gly Cys Phe Trp Asn Phe He Ser Glu Glu His Gly 
1235 1240 1245 

30 

TGG CAA GAA AAA TAA 3759 
Trp Gin Glu Lys 
1250 

35 (2) INFORMATION FOR SEQ ID NO: 52: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1252 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Pro Lvs He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp Arg 
i ' 5 10 15 



45 



Thr He Leu Tyr He Lys Pro Gly Gly Cys Gin Glu Phe Tyr Lys Ser 
5Q 20 25 30 

Phe Asn He Met Lys Asn He Trp He He Pro Glu Arg Asn Val He 
35 40 45 

55 Gly Thr Thr Pro Gin Asp Phe His Pro Pro Thr Ser Leu Lys Asn Gly 

50 55 60 



60 



Asp Ser Ser Tyr Tyr Asp Pro Asn Tyr Leu Gin Ser Asp Glu Glu Lys 

65 70 75 80 

Asp Arg Phe Leu Lys He Val Thr Lys He Phe Asn Arg He Asn Asn 

85 90 95 



Asn Leu Ser Gly Gly He Leu Leu Glu Glu Leu Ser Lys Ala Asn Pro 
f)5 100 105 110 

Tyr Leu Gly Asn Asp Asn Thr Pro Asp Asn Gin Phe His He Gly Asp 
115 120 125 

70 Ala Ser Ala Val Glu He Lys Phe Ser Asn Gly ser Gin Asp He Leu 
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140 



Leu Pro Asn Val He He Met Gly Ala Glu Pro Asp Leu Phe Glu Thr 
145 150 

Asn Ser Ser Asn He Ser Leu Arg Asn Asn Tyr Met Pro Ser Asn His 
165 170 175 

Gly Phe Gly Ser He Ala He Val Thr Phe Ser Pro Glu Tyr Ser Phe 
180 185 190 

Arg Phe Asn Asp Asn Ser Met Asn Glu Phe He Gin Asp Pro Ala Leu 
195 200 205 

Thr Leu Met His Glu Leu He His Ser Leu His Gly Leu Tyr Giv Ala 
210 215 220 

Lys Gly He Thr Thr Lys Tyr Thr He Thr Gin Lys Gin Asn Pro Leu 
225 230 235 240 

He Thr Asn He Arg Gly Thr Asn He Glu Glu Phe Leu Thr Phe Gly 
245 250 255 

Gly Thr Asp Leu Asn He He Thr Ser Ala Gin Ser Asn Asp He Tyr 
260 265 270 

Thr Asn Leu Leu Ala Asp Tyr Lys Lys He Ala Ser Lvs Leu Ser Lys 
275 280 285 

Val Gin Val Ser Asn Pro Leu Leu Asn Pro Tyr Lys Asp Val Phe Glu 
290 295 300 

Ala Lys Tyr Gly Leu Asp Lys Asp Ala Ser Gly He Tyr Ser Val Asn 
305 310 315 320 

He Asn Lys Phe Asn Asp He Phe Lys Lys Leu Tyr Ser Phe Thr Glu 
325 330 ' 335 

Phe Asp Leu Ala Thr Lys Phe Gin Val Lys Cys Arg Gin Thr Tyr He 
340 345 350 

Gly Gin Tyr Lys Tyr Phe Lys Leu Ser Asn Leu Leu Asn Asp Ser He 
355 360 365 

Tyr Asn He Ser Glu Gly Tyr Asn He Asn Asn Leu Lys Val Asn Phe 
370 375 380 

Arg Gly Gin Asn Ala Asn Leu Asn Pro Arg He He Thr Pro He Thr 
385 390 395 400 

Gly Arg Gly Leu Val Lys Lys He He Arg Phe Cys Lys Asn He Val 
405 410 415 

Ser Val Lys Gly He Arg Lys Ser He Cys He Glu He Asn Asn Gly 
420 425 430 

Glu Leu Phe Phe Val Ala Ser Glu Asn Ser Tyr Asn Asp Asp Asn He 
435 440 445 

Asn Thr Pro Lys Glu He Asp Asp Thr Val Thr Ser Asn Asn Asn Tyr 
450 455 460 

Glu Asn Asp Leu Asp Gin Val He Leu Asn Phe Asn Ser Glu Ser Ala 
465 470 475 480 

Pro Gly Leu Ser Asp Glu Lys Leu Asn Leu Thr He Gin Asn Asp Ala 
485 490 495 

Tyr He Pro Lys Tyr Asp Ser Asn Gly Thr Ser Asp He Glu Gin His 
500 505 510 
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"Asp Val Asn Glu Leu Asn Val Phe Phe Tyr Leu Asp Ala Gin Lys Val 
515 520 525 

Pro Glu Gly Glu Asn Asn Val Asn Leu Thr Ser Ser He Asp Thr Ala 
b J u 535 



54 0 



Leu Leu Glu Gin Pro Lys He Tyr Thr Phe Phe Ser Ser Glu Phe He 
550 • 

Asn Asn Val Asn Lys Pro Val Gin Ala Ala Leu Phe Val Ser Tro He 
565 570 

Gin Gin Val Leu Val Asp Phe Thr Thr Glu Ala Asn Gin Lys Ser Thr 
580 585 590 

Val Asp Lys He Ala Asp He Ser He Val Val Pro Tyr He Glv Leu 
595 600 605 

Ala Leu Asn He Gly Asn Glu Ala Gin Lys Gly Asn Phe Lys Asp Ala 
^J-U 615 620 

Leu Glu Leu Leu Gly Ala Gly He Leu Leu Glu Phe Glu Pro Glu Leu 
630 635 640 

Leu He Pro Thr He Leu Val Phe Thr He Lys Ser Phe Leu Gly Ser 
645 650 655 

Ser Asp Asn Lys Asn Lys Val He Lys Ala He Asn Asn Ala Leu Lvs 

665 670 

Glu Arg Asp Glu Lys Trp Lys Glu Val Tyr Ser Phe He Val Ser Asn 
675 680 685 

Trp Met Thr Lys lie Asn Thr Gin Phe Asn Lys Arg Lys Glu Gin Met 

690 695 



700 



Tyr Gin Ala Leu Gin Asn Gin Val Asn Ala He Lys Thr He He Glu 

'05 710 -71 c 

^-Lu 715 720 

Ser Lys Tyr Asn Ser Tyr Thr Leu Glu Glu Lys Asn Glu Leu Thr Asn 
"725 730 735 

Lys Tyr Asp He Lys Gin He Glu Asn Glu Leu Asn Gin Lys Val Ser 
740 745 750 

He Ala Met Asn Asn He Asp Arg Phe Leu Thr Glu Ser Ser He Ser 
755 760 765 

Tyr Leu Met Lys Leu He Asn Glu Val Lys He Asn Lvs Leu Arq Glu 
770 775 780 ^ 

Tyr Asp Glu Asn Val Lys Thr Tyr Leu Leu Asn Tyr He He Gin His 
790 795 800 

Gly Ser He Leu Gly Glu Ser Gin Gin Glu Leu Asn Ser Met Val Thr 
805 810 815 

Asp Thr Leu Asn Asn Ser He Pro Phe Lys Leu Ser Ser Tyr Thr Asp 
620 825 830 

Asp Lys He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lvs 
835 840 845 

Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 
950 855 860 



Thr Ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 
670 875 880 

Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
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885 890 895 

Glu Val Asn lie Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr 
900 905 910 

Lys Asn Phe Ser lie Ser Phe Trp Val Arg lie Pro Asn Tyr Asp Asn 
915 920 925 

Lys He Val Asn Val Asn Asn Glu Tyr Thr lie He Asn Cys Met Arg 
930 935 940 

Asp Asn Asa Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu IJ.e He 
945 950 955 960 

Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 
965 9*70 975 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
980 985 990 

Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
995 1000 1005 

Gly Asn Leu He Asp Gin Lys Ser He Leu Asn Leu Gly Asn He His 
1010 " 1015 1020 

Val Ser Asp Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg 
1025 1030 1035 1040 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
1045 ' 1050 1055 

Thr Glu He Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu 
1060 1065 1070 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
1075 1080 1085 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asp Arg Arg Lys Asp Ser 
1090 1095 1100 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg 
1105 1110 1115 1120 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
1125 1130 1135 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
1140 1145 1150 

Val Ala Ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr 
1155 1160 1165 

Thr Asn Lys Glu Lys Thr He Lys He Ser Ser. Ser Gly Asn Arg Phe 
1170 1175 1180 

Asn Gin Val Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn 
1185 1190 1195 1200 

Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala 
1205 1210 1215 

Asp Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp His 
1220 1225 1230 

Thr Asn Ser Asn Gly Cys Phe Trp Asn Phe He Ser Glu Glu His Gly 
1235 1240 124S 

Trp Gin Glu Lys 
1250 
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(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 108., 1460 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 6 0 

20 TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 
His His His His His His His His His Ser Ser Gly His lie Glu Gly 
5 10 15 



CGT CAT ATG GCT AGC ATG GCT CTT TCT TCT TAT ACA GAT GAT AAA ATT 212 
Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp Asp Lys lie 
20 25 30 35 

TTA ATT TCA TAT TTT AAT AAG TTC TTT AAG AGA ATT AAA AGT AGT TCT 26 0 

Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys Ser Ser Ser 
40 45 50 



GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAC GTA GAT ACT TCA GGA 
Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp Thr Ser Gly 
55 60 65 



AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT AAG ATA GTA 
Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn Lys He Val 
120 125 130 



308 



TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA TAT CCA ACT 3 56 

Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys Tyr Pro Thr 
70 75 80 

AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT GAA GTT AAT 4 04 

4^ Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser Glu VaJ Asn 
85 90 95 

ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT AAA AAT TTT 4 52 

He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr Lys Asn Phe 

^" JOO 105 110 115 



500 



AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGG GAT AAT AAT 54 8 

Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg Asp Asn Asn 
135 140 145 

TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT TGG ACA TTG 596 

Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He Trp Thr Leu 
150 155 160 

CAA GAT AAT TCA GGA ATT AAT CAA AAA TTA GCA TTT AAC TAT GGT AAC 64 4 

Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn Tyr Gly Asn 
165 170 175 



GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT GTA ACT ATA 6 92 

Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe Val Thr He 
180 IBS 190 195 



- 324 - 



ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT GGA AAT TTA 
Thr Asn Asp Arg Leu Gly Asp Ser Lys hBu Tyr lie Asn Gly Asn Leu 
200 205 210 

ATA GAT AAA AAA TCA ATT TTA AAT TTA GGT AAT ATT CAT GTT AGT GAC 
lie AGp Lys Lys Ser He Leu Asn Leu Gly Asn He His Val Ser Asp 
215 220 225 

AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA TAT ATT GGT 
Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg Tyr He Gly 
230 235 240 

ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA ACA GAA ATT 
He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu Thr Glu He 
245 250 255 

CAA ACT TTA TAT AAC AAT GAA CCT AAT GCA AAT ATT TTA AAG GAT TTT 
Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu Lys Asp Phe 
260 265 270 275 

TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA TTA AAT GTG 
Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu Leu Asn Val 
280 285 290 

TTA AAA CCA AAT AAC TTT ATT AAT AGG AGA ACA GAT TCT ACT TTA AGC 
Leu Lys Pro Asn Asn Phe He Asn Arg Arg Thr Asp Ser Thr Leu Ser 
295 300 305 

ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA TTA TAT AGT 
He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg Leu Tyr Ser 
310 315 320 

GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT ACT AAC GAT 
Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser Thr Asn Asp 
325 330 335 

AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT GTA GCC AGC 
Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe Val Ala Ser 
340 345 350 355 

AAA ACT CAC TTA CTT CCA TTA TAT GCT GAT ACA GCT ACC ACA AAT AAA 
Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr Thr Asn Lys 
360 365 370 

GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT AAT CAA GTA 
Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arg Phe Asn Gin Val 
375 380 38S 

GTA GTT ATG AAT TCA GTA GGA AAT TGT ACA ATG AAT TTT AAA AAT AAT 
Val Val Met Asn Ser Val Gly Asn Cys Thr Met Asn Phe Lys Asn Asn 
390 395 400 

AAT GGA AAT AAT ATT GGG TTG TTA GGT TTC AAG GCA GAT ACT GTA GTT 
Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala Asp Thr Val Val 
405 410 415 

GCT AGT ACT TGG TAT TAT ACA CAT ATG AGA GAT AAT ACA AAC AGC AAT 
Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr Asn Ser Asn 
420 425 430 435 

GGA TTT TTT TGG AAC TTT ATT TCT GAA GAA CAT GGA TGG CAA GAA AAA 
Gly Phe Phe Trp Asn Phe He Ser Glu Glu His Gly Trp Gin Glu Lys 
440 445 450 

TAA 

(2) INFORMATION FOR SEQ ID NO: 54: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 51 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Gly fUs His Hxs His His His His Hxs Hxs His Ser Ser Gly Hxs 
^ 5 10 15 

He Glu Gly Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asd 
^0 25 30 ^ 

Asp Lys lie Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys 
35 40 ^ y 



45 



Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 
50 55 60 

Thr ser Gly Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 

75 eo 



Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 
85 90 

Glu Val Asn He Ser Gin Asn Asp Tyr He He Tvr Asp Asn Lys Tvr 
100 105 ' 1X0 

Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 
H5 120 125 

Lys He Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg 
1^0 135 j^^Q 

Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn Hrs Asn Glu He He 
iSO 155 160 

Trp Thr Leu Gin Asp Asn Ser Gly He Asn Gin Lys Leu Ala Phe Asn 
165 170 X75 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
180 185 190 

Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
195 200 205 

Gly Asn Leu He Asp Lys Lys Ser He Leu Asn Leu Gly Asn He His 
2X0 215 220 

Val ser Asp Asn He Leu Phe Lys He Val Asn C^^s Ser Tyr Thr Arg 
230 235 240 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
245 250 255 

Thr Glu He Gin Thr Leu Tyr Asn Asn Glu Pro Asn Ala Asn He Leu 
260 265 270 

Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
275 280 285 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asn Arg Arq Thr Asd Ser 
290 295 300 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arq 
310 315 320 



Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
325 330 335 
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Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
340 345 350 

Val Ala Ser Lys Thr His Leu Leu Pro Leu Tyr Ala Asp Thr Ala Thr 
355 360 365 

Thr Asn Lys Glu Lys Thr lie Lys He Ser Ser Ser Gly Asn Arq Phe 
370 375 380 

Asn Gin val Val Val Met Asn Ser Val Gly Asn Cys Thr Met Asn Phe 
385 390 395 400 

r.ys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lvs Ala Asp 
]- 405 410 ' 415 

Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp Asn Thr 
420 425 430 

Asn Ser Asn Gly Phe Phe Trp Asn Phe He Ser Glu Glu His Gly Trp 
-t^ 435 440 445 

Gin Glu Lys 
450 

t2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 
-n) {C> STRANDEDNESS : double 

(D) TOPOLOGY: linear 

ixi) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

,>> 

(ix> FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 108 1463 ^ 
(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 6 0 



45 



C>() 



65 



70 



TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 
1 



CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 

His His His His His His His His His Ser Ser Gly His He Glu Glv 
5 10 15 

CGT CAT ATG GCT AGC ATG GCT CTT TCT TCT TAT ACA GAT GAT AAA ATT 212 

Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp Asp Lys He 

20 25 30 35 

TTA ATT TCA TAT TTT AAT AAA TTC TTT AAG AGA ATT AAA ACT AGT TCA 260 

Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg He Lys Ser Ser Ser 
40 45 50 



GTT TTA AAT ATG AGA TAT AAA AAT GAT AAA TAG GTA GAT ACT TCA GGA 3 08 

Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp Thr Ser Giy 
55 60 65 " 

TAT GAT TCA AAT ATA AAT ATT AAT GGA GAT GTA TAT AAA TAT CCA ACT 3 56 

Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys Tyr Pro Thr 
70 75 80 

AAT AAA AAT CAA TTT GGA ATA TAT AAT GAT AAA CTT AGT GAA GTT AAT 4 04 

Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser Glu Val Asn 
85 90 95 
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40 



60 



ATA TCT CAA AAT GAT TAC ATT ATA TAT GAT AAT AAA TAT AAA AAT TTT 
lie Ser Gin Asn Asp Tyr He lie Tyr Asp Asn Lys Tyr Lys Asn P^l 

HO 115 

AGT ATT AGT TTT TGG GTA AGA ATT CCT AAC TAT GAT AAT AAG ATA GTA 
Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn Lys lie Val 
120 125 



TCA GGA TGG AAA GTA TCT CTT AAT CAT AAT GAA ATA ATT TGG ACA TTG 
Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He Trp Thr Leu 

X50 155 --- 



160 



320 



452 



500 



AAT GTT AAT AAT GAA TAC ACT ATA ATA AAT TGT ATG AGA GAT AAT AAT S4fl 
10 Asn Val A.n Asn Glu Tyr Thr He He Asn Cys Met Arg Asp A^n 

135 140 



596 



644 



6 92 



740 



CAA GAT AAT GCA GGA ATT AAT CAA AAA TTA GCA TTT AAC TAT GGT AAC 
Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn Tyr Gly Asn 
165 170 175 ^ 

GCA AAT GGT ATT TCT GAT TAT ATA AAT AAG TGG ATT TTT GTA ACT ATA 
Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe Val Thr He 
195 190 195 

2-^ ACT AAT GAT AGA TTA GGA GAT TCT AAA CTT TAT ATT AAT GGA AAT TTA 

Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn Gly Asn Leu 
200 205 210 

-.0 ^ "^^^ '^^ '^^ AAT ATT CAT GTT AGT GAC 788 

He Asp Gin Lys Ser He Leu Asn Leu Gly Asn He His Val Ser Asp 
215 220 225 

AAT ATA TTA TTT AAA ATA GTT AAT TGT AGT TAT ACA AGA TAT ATT GGT 
Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arg Tyr He Glv 
230 235 240 

ATT AGA TAT TTT AAT ATT TTT GAT AAA GAA TTA GAT GAA ACA GAA ATT 
He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu Thr Glu He 
245 250 255 

CAA ACT TTA TAT AGC AAT GAA CCT AAT ACA AAT ATT TTG AAG GAT TTT 
Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu Lys Asp Phe 
265 270 ^ 275 

4^ TGG GGA AAT TAT TTG CTT TAT GAC AAA GAA TAC TAT TTA TTA AAT GTG 

Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu Leu Asn Val 
280 285 290 

TTA AAA CCA AAT AAC TTT ATT GAT AGG AGA AAA GAT TCT ACT TTA AGC 1028 
Leu Lys Pro Asn Asn Phe He Asp Arg Arg Lys Asp Ser Thr Leu Ser 
295 300 305 

ATT AAT AAT ATA AGA AGC ACT ATT CTT TTA GCT AAT AGA TTA TAT AGT 
He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg Leu Tyr Ser 
310 315 



B36 



884 



93: 



980 



1076 



GGA ATA AAA GTT AAA ATA CAA AGA GTT AAT AAT AGT AGT ACT AAC GAT 

Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser Thr Asn Asp 
325 330 335 

AAT CTT GTT AGA AAG AAT GAT CAG GTA TAT ATT AAT TTT GTA GCC AGC 
Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe Val Ala Ser 
345 350 355 

6> AAA ACT CAC TTA TTT CCA TTA TAT GCT GAT ACA GCT ACC ACA AAT AAA 

Lys Thr Hxs Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr Thr Asn Lys 
360 365 370 

GAG AAA ACA ATA AAA ATA TCA TCA TCT GGC AAT AGA TTT AAT CAA GTA 126ft 
m Glu Lys Thr He Lys He Ser Ser Ser Gly Asn Arg Phe Asn Gin Val 



1124 



1172 



1220 
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375 380 385 

GTA GTT ATG AAT TCA GTA GGA AAT AAT TGT ACA ATG AAT TTT AAA AAT 1316 
Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn Phe Lys Asn 
390 395 400 

AAT AAT GGA AAT AAT ATT GGG TTG TTA GGT TTC AAG GCA GAT ACT GTA 1364 
Asn Asn Gly Asn Asn lie Gly Leu Leu Gly Phe Lys Ala Asp Thr Val 
4QS 410 415 

GTT GCT AGT ACT TGG TAT TAT ACA CAT ATG AG A GAT CAT ACA AAC AGC 1412 
Val Ala Ser Thr Trp Tyr Tyr Thr His Met Arg Asp His Thr Asn Ser 
420 425 430 435 

i5 AAT GGA TGT TTT TGG AAC TTT ATT TCT GAA GAA CAT GGA TGG CAA GAA 14 6 0 

Asn Gly Cys Phe Trp Asn Phe He Ser Glu Glu His Gly Trp Gin Glu 
440 445 450 

AAA TAAAAGCTT 14 72 

20 Lys 

(2> INFORMATION FOR SEQ ID NO: 56: 

(x) SEQUENCE CHARACTERISTICS: 
25 lA) LENGTH: 452 ammo acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 



30 



(ii }. MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
15 10 15 

35 

He Glu Gly Arg His Met Ala Ser Met Ala Leu Ser Ser Tyr Thr Asp 
20 25 30 

Asp Lys He Leu He Ser Tyr Phe Asn Lys Phe Phe Lys Arg Tie Lys 
40 * 35 4 0 4 5 

Ser Ser Ser Val Leu Asn Met Arg Tyr Lys Asn Asp Lys Tyr Val Asp 
SO 55 60 

4^ Thr Ser Glv Tyr Asp Ser Asn He Asn He Asn Gly Asp Val Tyr Lys 

65 70 75 ' 80 



50 



65 



Tyr Pro Thr Asn Lys Asn Gin Phe Gly He Tyr Asn Asp Lys Leu Ser 

85 90 95 

Glu Val Asn He Ser Gin Asn Asp Tyr He He Tyr Asp Asn Lys Tyr 

100 105 110 



Lys Asn Phe Ser He Ser Phe Trp Val Arg He Pro Asn Tyr Asp Asn 

55 115 120 12S 

Lys lie Val Asn Val Asn Asn Glu Tyr Thr He He Asn Cys Met Arg 
130 135 140 

60 Asp Asn Asn Ser Gly Trp Lys Val Ser Leu Asn His Asn Glu He He 

145 150 155 160 



Trp Thr Leu Gin Asp Asn Ala Gly He Asn Gin Lys Leu Ala Phe Asn 
165 170 ' 175 

Tyr Gly Asn Ala Asn Gly He Ser Asp Tyr He Asn Lys Trp He Phe 
180 185 190 



Val Thr He Thr Asn Asp Arg Leu Gly Asp Ser Lys Leu Tyr He Asn 
70 195 200 205 
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Gly Asn Leu He Asp Gin Lys Ser He Leu Asn Leu Gly Asn He His 
2X0 215 220 

Val Ser Asp Asn He Leu Phe Lys He Val Asn Cys Ser Tyr Thr Arq 
^ 225 230 235 240 

Tyr He Gly He Arg Tyr Phe Asn He Phe Asp Lys Glu Leu Asp Glu 
245 250 255 

10 Thr Glu He Gin Thr Leu Tyr Ser Asn Glu Pro Asn Thr Asn He Leu 

260 265 270 



15 



40 



45 



50 



Lys Asp Phe Trp Gly Asn Tyr Leu Leu Tyr Asp Lys Glu Tyr Tyr Leu 
275 280 285 

Leu Asn Val Leu Lys Pro Asn Asn Phe He Asp Arg Arg Lys Asp Ser 
290 295 300 

Thr Leu Ser He Asn Asn He Arg Ser Thr He Leu Leu Ala Asn Arg 
305 310 315 320 

Leu Tyr Ser Gly He Lys Val Lys He Gin Arg Val Asn Asn Ser Ser 
325 330 335 

Thr Asn Asp Asn Leu Val Arg Lys Asn Asp Gin Val Tyr He Asn Phe 
340 345 350 

Val Ala Ser Lys Thr His Leu Phe Pro Leu Tyr Ala Asp Thr Ala Thr 
355 360 365 

Thr Asn Lys Glu Lys Thr He Lye He Ser Ser Ser Gly Asn Atg Phe 
370 375 380 



Asn Gin Val Val Val Met Asn Ser Val Gly Asn Asn Cys Thr Met Asn 
385 390 395 400 



Phe Lys Asn Asn Asn Gly Asn Asn He Gly Leu Leu Gly Phe Lys Ala 
405 410 415 

Asp Thr Val Val Ala Ser Thr Trp Tyr Tyr Thr Hxs Met Arg Asp His 
420 425 430 

Thr Asn Ser Asn Gly Cys Phe Trp Asn Phe He Ser Glu Glu His Gly 
435 440 445 

Trp Gin Glu Lys 
4 50 

(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5 ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

^>0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

CGCCATGGCT CTTTCTTCTT ATACAGATGA T 31 
(2) INFORMATION FOR SEQ ID NO:58: 

65 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
70 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

5 

GCAAGCTTTT ATTTTTCTTG CCATCCATG 2 9 

(2) INFORMATION FOR SEQ ID NO: 59: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3B76 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(li) MOLECULE TYPE: DNA {genomic) 

tix) FEATURE: 

(A) NAME/KEY: CDS 
20 (Bl LOCATION: 1..3873 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

ATG CCA ATA ACA ATT AAC AAC TTT AAT TAT TCA GAT CCT GTT GAT AAT 4 8 

25 Met Pro He Thr He Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp Asn 

IS 10 15 

AAA AAT ATT TTA TAT TTA GAT ACT CAT TTA AAT ACA CTA GCT AAT GAG 96 
Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Giu 
30 2 0 2 5 30 

CCT GAA AAA GCC TTT CGC ATT ACA GGA AAT ATA TGG GTA ATA CCT GAT 14 4 

Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp Val He Pro Asp 

3 5 4 0 4 5 

35 

AG A TTT TCA AGA AAT TCT AAT CCA AAT TTA AAT AAA CCT CCT CGA GTT 192 

Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arg Val 

SO 55 60 

4i) ACA AGC CCT AAA AGT GGT TAT TAT GAT CCT AAT TAT TTG AGT ACT GAT 24 0 

Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp 
65 70 75 80 

TCT GAC AAA GAT ACA TTT TTA AAA GAA ATT ATA AAG TTA TTT AAA AGA 288 
4^ Ser Asp Lys Asp Thr Phe Leu Lys Glu He He Lys Leu Phe Lys Arg 

85 90 ' 95 

ATT A/vT TCT AGA GAA ATA GGA GAA GAA TTA ATA TAT AGA CTT TCG ACA 3 36 

He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr Arg Leu Ser Thr 
50 100 105 110 

GAT ATA CCC TTT CCT GGG AAT AAC ?iAT ACT CCA ATT AAT ACT TTT GAT 384 

Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He Asn Thr Phe Asp 

115 120 125 

55 

TTT GAT GTA GAT TTT AAC AGT GTT GAT GTT AAA ACT AGA CAA GGT AAC 4 32 

Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg Gin Gly Asn 
130 135 140 

(>() AAC TGG GTT AAA ACT GGT AGC ATA AAT CCT AGT GTT ATA ATA ACT GGA 4 80 

Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val He He Thr Gly 
145 150 155 160 

CCT AGA GAA AAC ATT ATA GAT CCA GAA ACT TCT ACG TTT r\AA TTA ACT 528 
65 Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr 

165 170 175 
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AAC AAT ACT TTT GCG GCA CAA GAA GGA TTT GGT GCT TTA TCA ATA ATT 576 
Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala Leu Ser lie He 
180 X85 190 

5 TCA ATA TCA COT AGA TTT ATG CTA ACA TAT AGT AAT GCA ACT AAT GAT 624 

Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp 
195 200 205 

GTA GGA GAG GGT AGA TTT TCT AAG TCT GAA TTT TGC ATG GAT CCA ATA 672 
10 Val Gly Glu Gly Arg Phe Ser Lys ser Glu Phe Cys Met Asp Pro lie 

210 215 220 

CTA ATT TTA ATG CAT GAA CTT AAT CAT GCA ATG CAT AAT TTA TAT GGA 720 
Leu He Leu Met His Glu Leu Asn His Ala Met His Asn Leu Tyr Gly 
15 225 230 235 240 

ATA GCT ATA CCA AAT GAT CAA ACA ATT TCA TCT GTA ACT AGT AAT ATT 76 8 

lie Ala He Pro Asn Asp Gin Thr lie Ser Ser Val Thr Ser Asn He 
245 250 255 

20 

TTT TAT TCT CAA TAT AAT GTG AAA TTA GAG TAT GCA GAA ATA TAT GCA 816 
Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala Glu He Tyr Ala 
260 265 270 

25 TTT GGA GGT CCA ACT ATA GAC CTT ATT CCT AAA AGT GCA AGG AAA TAT 864 

Phe Gly Giy Pro Thr lie Asp Leu He Pro Lys Ser Ala Arg Lys Tyr 
275 280 285 

TTT GAG GAA AAG GCA TTG GAT TAT TAT AGA TCT ATA GCT AAA AGA CTT 912 
30 Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He Ala Lys Arg Leu 

290 295 300 

AAT AGT ATA ACT ACT GCA AAT CCT TCA AGC TTT AAT AAA TAT ATA GGG 960 
Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr He Gly 
35 305 310 315 320 

GAA TAT AAA CAG AAA CTT ATT AGA AAG TAT AGA TTC GTA GTA GAA TCT 1008 

Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe Val Val Glu Ser 

325 330 335 

40 

TCA GGT GAA GTT ACA GTA AAT CGT AAT AAG TTT GTT GAG TTA TAT AAT 1056 

Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn 

340 345 350 

45 GAA CTT ACA CAA ATA TTT ACA GAA TTT AAC TAC GCT AAA ATA TAT AAT 1104 

Glu Leu Thr Gin He Phe Thr Glu Phe Asn Tyr Ala Lys He Tyr Asn 
355 360 365 

GTA CAA AAT AGG AAA ATA TAT CTT TCA AAT GTA TAT ACT CCG GTT ACG 1152 
50 Val Gin Asn Arg Lys He Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr 

370 375 380 

GCG AAT ATA TTA GAC GAT AAT GTT TAT GAT ATA CAA AAT GGA TTT AAT 1200 
Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin Asn Gly Phe Asn 
55 385 390 395 400 

ATA CCT AAA AGT AAT TTA AAT GTA CTA TTT ATG GGT CAA AAT TTA TCT 124 8 

He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gin Asn Leu Ser 
405 410 415 



(>() 



CGA AAT CCA GCA TTA AGA AAA GTC AAT CCT GAA AAT ATG CTT TAT TTA 1296 
Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu 
420 425 430 



65 TTT ACA AAA TTT TGT CAT AAA GCA ATA GAT GGT AGA TCA TTA TAT AAT 1344 

Phe Thr Lys Phe Cys His Lys Ala He Asp Gly Arg Ser Leu Tyr Asn 
435 440 445 

AAA ACA TTA GAT TGT AGA GAG CTT TTA GTT AAA AAT ACT GAC TTA CCC 1392 
70 Lys Thr Leu Asp Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro 
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450 ^ 455 460 

TTT ATA GGT GAT ATT AGT GAT GTT AAA ACT GAT ATA TTT TTA AG A AAA 1440 
Phe lie Gly Asp He Ser Asp Vai Lys Thr Asp He Phe Leu Arg Lys 
465 470 475 480 

GAT ATT AAT GAA GAA ACT GAA GTT ATA TAC TAT CCG GAC AAT GTT TCA 14 88 

Asp He Asn Glu Glu Thr Glu Val He Tyr Tyr Pro Asp Asn Val Ser 
485 490 495 

GTA GAT CAA GTT ATT CTC AGT AAG AAT ACC TCA GAA CAT GGA CAA CTA 1536 
Val Asp Gin Vai He Leu Ser Lys Asn Thr Ser Glu His Giy Gj.n Leu 
500 505 510 

15 GAT TTA TTA TAC CCT AGT ATT GAC AGT GAG AGT GAA ATA TTA CCA GGG 15S4 

Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu He Leu Pro Gly 
515 520 525 

GAG AAT CAA GTC TTT TAT GAT AAT AGA ACT CAA AAT GTT GAT TAT TTG 1632 
20 Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu 

530 535 540 

AAT TCT TAT TAT TAC CTA GAA TCT CAA AAA CTA AGT GAT AAT GTT GAA 168 0 

Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu 
25 545 550 555 560 

GAT TTT ACT TTT ACG AGA TCA ATT GAG GAG GCT TTG GAT AAT AGT GCA 1728 

Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala 
565 570 575 

30 

AAA GTA TAT ACT TAC TTT CCT ACA CTA GCT AAT AAA GTA AAT GCG GGT 17 76 

Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly 
580 585 590 

35 GTT CAA GGT GGT TTA TTT TTA ATG TGG GCA AAT GAT GTA GTT GAA GAT 1824 

Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp 
595 600 605 

TTT ACT ACA AAT ATT CTA AGA AAA GAT ACA TTA GAT AAA ATA TCA GAT 1872 
40 Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp 

610 615 620 

GTA TCA GCT ATT ATT CCC TAT ATA GGA CCC GCA TTA AAT ATA AGT AAT 1920 
Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu Asn He Ser Asn 
4^ 625 630 ' 635 640 

TCT GTA AGA AGA GGA AAT TTT ACT GAA GCA TTT GCA GTT ACT GGT GTA 1968 
Ser Vai Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val 
645 650 655 



50 



(yi) 



70 



ACT ATT TTA TTA GAA GCA TTT CCT GAA TTT ACA ATA CCT GCA CTT GGT 2 016 

Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly 
660 665 670 

GCA TTT GTG ATT TAT AGT AAG GTT CAA GAA AGA AAC GAG ATT ATT AAA 2064 

Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys 
675 680 685 

ACT ATA GAT AAT TGT TTA GAA CAA AGG ATT AAG AGA TGG AAA GAT TCA 2112 

Thr He Asp Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys Asp Ser 
690 ' 695 ^ 700 



TAT GAA TGG ATG ATG GGA ACG TGG TTA TCC AGG ATT ATT ACT CAA TTT 216 0 

Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe 
65 -fQS 710 715 720 



AAT AAT ATA AGT TAT CAA ATG TAT GAT TCT TTA AAT TAT CAG GCA GGT 2208 
Asn Asn He Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Gly 
725 730 735 
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20 



30 



40 



GCA ATC AAA GCT AAA ATA GAT TTA GAA TAT AAA AAA TAT TCA GGA AGT 2256 

Ala lie Lys Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser 
740 745 750 

GAT AAA GAA AAT ATA AAA AGT CAA GTT GAA AAT TTA AAA AAT AGT TTA 2 304 

Asp Lys Glu Asn lie Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu 
'7S5 760 765 

GAT GTA AAA ATT TCG GAA GCA ATG AAT AAT ATA AAT AAA TTT ATA CGA 2 352 

Asp Val Lys He Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arq 
770 775 780 



GAA TGT TCC GTA ACA TAT TTA TTT AAA AAT ATG TTA CCT AAA GTA ATT 2400 
_ Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lvs Val He 

1^ 785 790 795 ' 800 



GAT GAA TTA AAT GAG TTT GAT CGA AAT ACT AAA GCA AAA TTA ATT AAT 244 8 

Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu He Asn 
805 810 ' 815 

CTT ATA GAT AGT CAT AAT ATT ATT CTA GTT GGT GAA GTA GAT AAA TTA 24 96 

Leu He Asp Ser His Asn He He Leu Val Gly Glu Val Asp Lys Leu 
fi20 825 830 

AAA GCA AAA GTA AAT AAT AGC TTT CAA AAT ACA ATA CCC TTT AAT ATT 2544 
Lys Ala Lys Val Asn Asn Ser Phe Gin Asn Thr He Pro Phe Asn He 
835 840 845 

TTT TCA TAT ACT AAT AAT TCT TTA TTA AAA GAT ATA ATT AAT GAA TAT 2592 
Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr 
850 855 060 



TTC AAT AAT ATT AAT GAT TCA AAA ATT TTG AGC CTA CAA AAC AGA AAA 264 0 

Phe Asn Asn He Asn Asp Ser Lys He Leu Ser Leu Gin Asn Arg Lys 
865 870 875 880 



AAT ACT TTA GTG GAT ACA TCA GGA TAT AAT GCA GAA GTG AGT GAA GAA 26 88 

Asn Thr Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val Ser Glu Glu 
885 890 895 

GGC GAT GTT CAG CTT AAT CCA ATA TTT CCA TTT GAC TTT AAA TTA GGT 2736 
Gly Asp Val Gin Leu Asn Pro He Phe Pro Phe Asp Phe Lys Leu Gly 
900 905 910 

45 AGT TCA GGG GAG GAT AGA GGT AAA GTT ATA GTA ACC CAG AAT GAA AAT 2784 

Ser Ser Gly Glu Asp Arg Gly Lys Val lie Val Thr Gin Asn Glu Asn 
915 920 925 

ATT GTA TAT AAT TCT ATG TAT GAA AGT TTT AGC ATT AGT TTT TGG ATT 2832 
>0 He Val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser Phe Trp lie 

930 935 940 

AGA ATA AAT AAA TGG GTA AGT AAT TTA CCT GGA TAT ACT ATA ATT GAT 2880 
Arg He Asn Lys Trp Val Ser Asn Leu Pro Gly Tyr Thr He He Asp 
:>:> 945 950 955 960 

AGT GTT AAA AAT AAC TCA GGT TGG AGT ATA GGT ATT ATT AGT AAT TTT 2 928 

Ser Val Lys Asn Asn Ser Gly Trp Ser He Gly He He Ser Asn Phe 

965 970 975 

60 

TTA GTA TTT ACT TTA AAA CAA AAT GAA GAT AGT GAA CAA AGT ATA AAT 2 976 

Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin Ser He Asn 
980 985 990 

65 TTT AGT TAT GAT ATA TCA AAT AAT GCT CCT GGA TAC AAT AAA TGG TTT 3024 

Phe Ser Tyr Asp He Ser Asn Asn Ala Pro Gly Tyr Asn Lys Trp Phe 
995 1000 1005 

TTT GTA ACT GTT ACT AAC AAT ATG ATG GGA AAT ATG AAG ATT TAT ATA 3 072 

70 Phe Val Thr Val Thr Asn Asn Met Met Gly Asn Met Lys He Tyr He 
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1010 1015 1020 

AAT GGA AAA TTA ATA GAT ACT ATA AAA GTT AAA GAA CTA ACT GGA ATT 3120 
Asn Gly Lys Leu Xle Asp Thr He Lys Val Lys Glu I»eu Thr Gly He 
1025 1030 1035 1040 

AAT TTT AGO AAA ACT ATA AC A TTT GAA ATA AAT AAA ATT CCA GAT ACC 3168 
Asn Phe Ser Lys Thr He Thr Phe Glu He Asn Lyn He Pro Asp Thr 
1045 1050 1055 

GGT TTG ATT ACT TCA GAT TCT GAT AAC ATC AAT ATG TGG ATA AGA GAT 3216 
Giy Leu He Thr Ser Asp Ser Asp Asn He Asn Met Trp He Arg Asp 
1060 1065 1070 

15 TTT TAT ATA TTT GCT AAA GAA TTA GAT GGT AAA GAT ATT AAT ATA TTA 3264 

Phe Tyr He Phe Ala Lys Glu Leu Asp Gly Lys Asp He Asn He Leu 
1075 ' 1080 1065 

TTT AAT AGC TTG CAA TAT ACT AAT GTT GTA AAA GAT TAT TGG GGA AAT 3312 
20 Phe Asn Ser Leu Gin Tyr Thr Asn Val Val Lys Asp Tyr Trp Gly Asn 

1090 1095 1100 

GAT TTA AGA TAT AAT AAA GAA TAT TAT ATG GTT AAT ATA GAT TAT TTA 33 6 0 

Asp Leu Arg Tyr Asn Lys Glu Tyr Tyr Met Val Asn He Asp Tyr Leu 
25 H05 1110 1115 1120 

AAT AGA TAT ATG TAT GCG AAC TCA CGA CAA ATT GTT TTT AAT ACA CGT 34 08 

Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin He Val Phe Asn Thr Arg 
112S L130 il3S 

M) 

AGA AAT AAT AAT GAC TTC AAT GAA GGA TAT AAA ATT ATA ATA AAA AGA 34 56 

Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys He He He Lys Arg 
1140 1145 1150 

35 ATC AGA GGA AAT ACA AAT GAT ACT AGA GTA CGA GGA GGA GAT ATT TTA 3 504 

He Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Gly Asp He Leu 
1155 1160 1165 

TAT TTT GAT ATG ACA ATT AAT AAC AAA GCA TAT AAT TTG TTT ATG AAG 3 5 52 

40 Tyr Phe Asp Met Thr He Asn Asn Lys Ala Tyr Asn Leu Phe Met Lys 

1170 117S HBO 

AAT GAA ACT ATG TAT GCA GAT AAT CAT AGT ACT GAA GAT ATA TAT GCT 3600 
Asn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp He Tyr Ala 
45 1183 1190 1195 1200 

ATA GGT TTA AGA GAA CAA ACA AAG GAT ATA AAT GAT AAT ATT ATA TTT 3648 
He Gly Leu Arg Glu Gin Thr Lys Asp He Asn Asp Asn He He Phe 
1205 1210 1215 

50 

CAA ATA CAA CCA ATG AAT AAT ACT TAT TAT TAC GCA TCT CAA ATA TTT 36 96 

Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser Gin He Phe 
1220 1225 1230 

55 AAA TCA AAT TTT AAT GGA GAA AAT ATT TCT GGA ATA TGT TCA ATA GGT 3744 

Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly He Cys Ser He Gly 
1235 1240 1245 

ACT TAT CGT TTT AGA CTT GGA GGT GAT TGG TAT AGA CAC AAT TAT TTG 3792 
f>() Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His Asn Tyr Leu 

1250 ~ 1255 1260 

GTG CCT ACT GTG AAG CAA GGA AAT TAT GCT TCA TTA TTA GAA TCA ACA 384 0 

Val Pro Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu Glu Ser Thr 
65 1265 1270 1275 1280 

TCA ACT CAT TGG GGT TTT GTA CCT GTA AGT GAA TAA 3 876 

Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 
1285 1290 

70 
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(21 INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:60: 

Met Pro lie Thr lie Asn Asn Phe Asn Tyr Ser Asp Pro Val Asp Asn 

15 10 15 

Lys Asn He Leu Tyr Leu Asp Thr His Leu Asn Thr Leu Ala Asn Glu 
20 25 ' 30 

Pro Glu Lys Ala Phe Arg He Thr Gly Asn He Trp Val He Pro Asp 
35 40 45 

Arg Phe Ser Arg Asn Ser Asn Pro Asn Leu Asn Lys Pro Pro Arg Val 
50 55 60 

Thr Ser Pro Lys Ser Gly Tyr Tyr Asp Pro Asn Tyr Leu Ser Thr Asp 
65 70 75 80 

Ser Asp Lys Asp Thr Phe Leu Lys Glu lie He Lys Leu Phe Lys Arg 
85 90 95 

He Asn Ser Arg Glu He Gly Glu Glu Leu He Tyr Arg Leu Ser Thr 
100 105 110 

Asp He Pro Phe Pro Gly Asn Asn Asn Thr Pro He Asn Thr Phe Asp 
115 120 125 

Phe Asp Val Asp Phe Asn Ser Val Asp Val Lys Thr Arg Gin Gly Asn 
130 135 140 

Asn Trp Val Lys Thr Gly Ser He Asn Pro Ser Val He He Thr Gly 
145 150 155 160 

Pro Arg Glu Asn He He Asp Pro Glu Thr Ser Thr Phe Lys Leu Thr 
165 170 175 

Asn Asn Thr Phe Ala Ala Gin Glu Gly Phe Gly Ala Leu Ser He He 
180 IBS 190 

Ser He Ser Pro Arg Phe Met Leu Thr Tyr Ser Asn Ala Thr Asn Asp 
195 200 205 

Val Gly Glu Gly Arg Phe Ser Lys Ser Glu Phe Cys Met Asp Pro He 
210 215 220 

Leu He Leu Met His Glu Leu Asn His Ala Met His Asn Leu Tyr Gly 
225 230 235 240 

He Ala He Pro Asn Asp Gin Thr He Ser Ser Val Thr Ser Asn He 
245 250 255 

Phe Tyr Ser Gin Tyr Asn Val Lys Leu Glu Tyr Ala Glu He Tyr Ala 

260 265 270 

Phe Gly Gly Pro Thr He Asp Leu He Pro Lys Ser Ala Arg Lys Tvr 
275 280 285 

Phe Glu Glu Lys Ala Leu Asp Tyr Tyr Arg Ser He Ala Lys Arg Leu 
290 295 300 

Asn Ser He Thr Thr Ala Asn Pro Ser Ser Phe Asn Lys Tyr He Gly 
305 310 315 320 
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Glu Tyr Lys Gin Lys Leu He Arg Lys Tyr Arg Phe Val Val Glu Ser 
325 330 335 

Ser Gly Glu Val Thr Val Asn Arg Asn Lys Phe Val Glu Leu Tyr Asn 
340 345 350 

Glu Leu Thr Gin lie Phe Thr Glu Phe Asn Tyr Ala Lys He Tyr Asn 
355 360 365 

Val Gin Asn Arq Lys He Tyr Leu Ser Asn Val Tyr Thr Pro Val Thr 
370 375 380 

Ala Asn He Leu Asp Asp Asn Val Tyr Asp He Gin Asn Gly Phe Asn 
385 390 395 400 

He Pro Lys Ser Asn Leu Asn Val Leu Phe Met Gly Gin Asn Leu Ser 
405 410 415 

Arg Asn Pro Ala Leu Arg Lys Val Asn Pro Glu Asn Met Leu Tyr Leu 
420 425 430 

Phe Thr Lys Phe Cys His Lys Ala He Asp Gly Arg Ser Leu Tyr Asn 
435 440 445 

Lys Thr Leu Asp Cys Arg Glu Leu Leu Val Lys Asn Thr Asp Leu Pro 
450 455 460 

Phe He Gly Asp He Ser Asp Val Lys Thr Asp He Phe Leu Arg Lys 
465 470 475 480 

Asp He Asn Glu Glu Thr Glu Val He Tyr Tyr Pro Asp Asn Val Ser 
485 490 495 

Val Asp Gin Val He Leu Ser Lys Asn Thr Ser Glu His Gly Gin Leu 
500 505 510 

Asp Leu Leu Tyr Pro Ser He Asp Ser Glu Ser Glu He Leu Pro Gly 
515 520 525 

Glu Asn Gin Val Phe Tyr Asp Asn Arg Thr Gin Asn Val Asp Tyr Leu 
530 535 540 

Asn Ser Tyr Tyr Tyr Leu Glu Ser Gin Lys Leu Ser Asp Asn Val Glu 
545 550 555 560 

Asp Phe Thr Phe Thr Arg Ser He Glu Glu Ala Leu Asp Asn Ser Ala 
565 570 575 

Lys Val Tyr Thr Tyr Phe Pro Thr Leu Ala Asn Lys Val Asn Ala Gly 
580 585 590 

Val Gin Gly Gly Leu Phe Leu Met Trp Ala Asn Asp Val Val Glu Asp 
595 600 605 

Phe Thr Thr Asn He Leu Arg Lys Asp Thr Leu Asp Lys He Ser Asp 
610 615 620 

Val Ser Ala He He Pro Tyr He Gly Pro Ala Leu Asn He Ser Asn 
625 630 635 640 

Ser Val Arg Arg Gly Asn Phe Thr Glu Ala Phe Ala Val Thr Gly Val 
645 650 655 

Thr He Leu Leu Glu Ala Phe Pro Glu Phe Thr He Pro Ala Leu Gly 
660 665 670 

Ala Phe Val He Tyr Ser Lys Val Gin Glu Arg Asn Glu He He Lys 
675 680 685 

Thr He Asp Asn Cys Leu Glu Gin Arg He Lys Arg Trp Lys Asp Ser 
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690 695 



700 



Tyr Glu Trp Met Met Gly Thr Trp Leu Ser Arg He He Thr Gin Phe 
5 715 720 

Asn Asn He Ser Tyr Gin Met Tyr Asp Ser Leu Asn Tyr Gin Ala Glv 
725 730 735 

Ala He Lys Ala Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser 
'^^ 740 745 750 

Asp Lys Glu Asn He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu 
755 760 765 

770 ^^"^ 



15 



20 



45 



50 



60 



65 



700 



Glu Cys Ser Val Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He 
"^^^ ^9.0 795 800 

Asp Glu Leu Asn Glu Phe Asp Arg Asn Thr Lys Ala Lys Leu He Asn 
605 810 815 

^ Leu He Asp Ser His Asn He He Leu Val Gly Glu Val Asp Lys Leu 

820 B25 830 

Lys Ala Lys Val Asn Asn Ser Phe Gin Asn Thr He Pro Phe Asn He 
935 840 845 

30 Phe Ser Tyr Thr Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tvr 

850 855 860 

Phe Asn Asn He Asn Asp Ser Lys He Leu Ser Leu Gin Asn Arq Lys 
.3 870 875 880 

Asn Thr Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val Ser Glu Glu 
865 890 895 

Gly Asp Val Gin Leu Asn Pro He Phe Pro Phe Asp Phe Lys Leu Glv 
900 905 910 

Ser Ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin Asn Glu Asn 
915 920 925 

He Val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser Phe Trp He 
930 935 940 

Arg He Asn Lys Trp Val Ser Asn Leu Pro Gly Tyr Thr He He Asp 
945 950 955 gg^ 

Ser Val Lys Asn Asn Ser Gly Trp Ser He Gly He He Ser Asn Phe 
965 970 975 

Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin Ser He Asn 
980 985 



990 



Phe Ser Tyr Asp He Ser Asn Asn Ala Pro Gly Tyr Asn Lvs Trp Phe 
995 1000 1005 

Phe Val Thr Val Thr Asn Asn Met Met Gly Asn Met Lys He Tyr He 
1010 1015 1020 

Asn Giy Lys Leu He Asp Thr He Lys Val Lys Glu Leu Thr Glv He 
1025 1030 1035 



1040 



Asn Phe Ser Lys Thr He Thr Phe Glu He Asn Lys He Pro Asp Thr 
1045 1050 1055 

Gly Leu He Thr Ser Asp Ser Asp Asn He Asn Met Trp He Arq Asp 
1060 1065 1070 



- 338 - 



wo 98/08540 



PCT/US97/153M 




- 339 - 



WO98A)8540 



PCT/US97/15394 



10 



20 25 30 35 

TTC AAT AAT ATT AAT GAT TCA AAA ATT TTG AGC CTA CAA AAC AGA AAA 260 
Phe Asn Asn He Asn Asp Ser Lys lie Leu Ser Leu Gin Asn Arg Lys 
40 45 50 

AAT ACT TTA GTG GAT ACA TCA GGA TAT AAT GCA GAA GTG AGT GAA GAA 3 08 

Asn Thr Leu Val Asp Thr Ser Gly Tvr Asn Ala Glu Vai Ser Glu Glu 
55 60 65 

GGC GAT GTT CAG CTT AAT CCA ATA TTT CCA TTT GAC TTT AAA TTA GGT 3 56 

Gly Asp Val Gin Leu Asn Pro He Phe Pro Phe Asp Phe Lys Leu Giy 
70 75 80 

15 AGT TCA GGG GAG GAT AGA GGT AAA GTT ATA GTA ACC CAG AAT GAA AAT 4 04 

Ser Ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin Asn Glu Asn 
85 90 95 

ATT GTA TAT AAT TCT ATG TAT GAA AGT TTT AGC ATT AGT TTT TGG ATT 4 52 

20 He Val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser Phe Trp He 

100 105 1X0 115 

AGA ATA AAT AAA TGG GTA AGT AAT TTA CCT GGA TAT ACT ATA ATT GAT 500 
Arg He Asn Lys Trp Val Ser Asn Leu Pro Gly Tyr Thr He He Asp 
25 120 125 130 

AGT GTT AAA AAT AAC TCA GGT TGG AGT ATA GGT ATT ATT AGT AAT TTT 54 B 

Ser Val Lys Asn Asn Ser Gly Trp Ser He Gly He He Ser Asn Phe 
135 140 145 

30 

TTA GTA TTT ACT TTA AAA CAA AAT GAA GAT AGT GAA CAA AGT ATA AAT 5 96 

Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin Ser He Asn 
150 155 160 

35 TTT AGT TAT GAT ATA TCA AAT AAT GCT CCT GGA TAC AAT AAA TGG TTT 64 4 

Phe Ser Tyr Asp He Ser Asn Asn Ala Pro Gly Tyr Asn Lys Trp Phe 
165 170 175 

TTT GTA ACT GTT ACT AAC AAT ATG ATG GGA AAT ATG AAG ATT TAT ATA 6 92 

40 Phe Val Thr Val Thr Asn Asn Met Met Gly Asn Met Lys He Tyr He 

180 185 190 195 

AAT GGA AAA TTA ATA GAT ACT ATA AAA GTT AAA GAA CTA ACT GGA ATT 74 0 

Asn Giy Lys Leu He Asp Thr He Lys Val Lys Glu Leu Thr Gly He 
45 200 205 210 

AAT TTT AGC AAA ACT ATA ACA TTT GAA ATA AAT AAA ATT CCA GAT ACC 79 8 

Asn Phe Ser Lys Thr He Thr Phe Glu He Asn Lys He Pro Asp Thr 

215 220 225 

50 

GGT TTG ATT ACT TCA GAT TCT GAT AAC ATC AAT ATG TGG ATA AGA GAT 8 36 

Gly Leu He Thr Ser Asp Ser Asp Asn He Asn Met Trp He Arg Asp 
230 235 240 

55 TTT TAT ATA TTT GCT AAA GAA TTA GAT GGT AAA GAT ATT AAT ATA TTA 8B4 

Phe Tyr He Phe Ala Lys Glu Leu Asp Gly Lys Asp He Asn He Leu 
245 250 255 

TTT AAT AGC TTG CAA TAT ACT AAT GTT GTA AAA GAT TAT TGG GGA AAT 932 
60 Phe Asn Ser Leu Gin Tyr Thr Asn Val Val Lys Asp Tyr Trp Gly Asn 

260 265 270 275 

GAT TTA AGA TAT AAT AAA GAA TAT TAT ATG GTT AAT ATA GAT TAT TTA 980 
Asp Leu Arg Tyr Asn Lys Glu Tyr Tyr Met Val Asn He Asp Tyr Leu 
65 280 285 290 

AAT AGA TAT ATG TAT GCG AAC TCA CGA CAA ATT GTT TTT AAT ACA CGT 1028 

Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin He Val Phe Asn Thr Arg 
295 300 305 

70 
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AG A AAT AAT AAT GAC TTC AAT GAA GGA TAT AAA ATT ATA ATA AAA AGA 1076 

Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys lie lie He Lys Arg 

310 315 320 

5 ATC AGA GGA AAT AC A AAT GAT ACT AGA GTA CGA GGA GGA GAT ATT TTA 1124 

He Arg Gly Acn Thr Asn Asp Thr Arg Val Arg Gly Gly Asp He Leu 
325 330 335 

TAT TTT GAT ATG ACA ATT AAT AAC AAA GGA TAT AAT TTG TTT ATG AAG 1172 

iO Tvr Phe Asp Met Thr He Asn Asn Lys Ala Tyr Asn Leu Phe Met Lys 

340 345 350 355 

AAT GAA ACT ATG TAT GCA GAT AAT CAT AGT ACT GAA GAT ATA TAT GCT 1220 

Asn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp He Tyr Ala 
15 360 365 370 

ATA GGT TTA AGA GAA CAA ACA AAG GAT ATA AAT GAT AAT ATT ATA TTT 1268 

Tie Gly Leu Arg Glu Gin Thr Lys Asp He Asn Asp Asn He He Phe 
375 380 385 

20 

CAA ATA CAA CCA ATG AAT AAT ACT TAT TAT TAC GCA TCT CAA ATA TTT 1316 

Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser Gin He Phe 

390 395 400 

25 AAA TCA AAT TTT AAT GGA GAA AAT ATT TCT GGA ATA TGT TCA ATA GGT 13 64 

Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly He Cys Ser He Gly 
405 410 415 

ACT TAT CGT TTT AGA CTT GGA GGT GAT TGG TAT AGA CAC AAT TAT TTG 1412 

30 Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His Asn Tyr Leu 

•120 425 430 435 

GTG CCT ACT GTG AAG CAA GGA AAT TAT GCT TCA TTA TTA GAA TCA ACA 146 0 

Val Pro Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu Glu Ser Thr 
3:* 440 445 450 

TCA ACT CAT TGG GGT TTT GTA CCT GTA AGT GAA TAAAAGCTT 1502 

Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 
455 460 



40 



60 



(2) INFORMATION FOR SEQ ID NO: 62: 



H) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: Ixnear 

(ii) MOLECULE TYPE: protein 

50 (xi) SEQUENCE DESCRIPTION; SEO ID NO: 62: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
15 10 15 

55 He Glu Gly Arg His Met Ala Ser Met Ala Leu Leu Lys Asp He He 

20 25 30 

Asn Glu Tyr Phe Asn Asn He Asn Asp Ser Lys He Leu Ser Leu Gin 
35 40 45 

Asn Axg Lys Asn Thr Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val 
50 55 60 



Ser Glu Glu Gly Asp Val Gin Leu Asn Pro He Phe Pro Phe Asp Phe 

65 65 ' 70 75 80 

Lys Leu Gly Ser Ser Gly Glu Asp Arg Gly Lys Val He Val Thr Gin 
85 90 95 

70 Asn Glu Asn He Val Tyr Asn Ser Met Tyr Glu Ser Phe Ser He Ser 
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100 105 110 

Phe Trp He Arg lie Asn Lys Trp Val Ser Asn Leu Pro Gly Tyr Thr 
115 120 125 

He He Asp Ser Val Lys Asn Asn Ser Gly Trp Ser He Gly He He 
130 135 140 

Ser Asn Phe Leu Val Phe Thr Leu Lys Gin Asn Glu Asp Ser Glu Gin 
150 155 160 

F 

Ser He Asn Phe Ser Tyr Asp He Ser Asn Asn Ala Pro Gly Tyr Asn 
165 170 175 

Lys Trp Phe Phe Val Thr Val Thr Asn Asn Met Met Gly Asn Met Lys 
180 185 ' 190 

He Tyr He Asn Gly Lys Leu He Asp Thr He Lys Val Lys Glu Leu 
195 200 205 

Thr Gly He Asn Phe Ser Lys Thr He Thr Phe Glu He Asn Lys He 
210 215 220 

Pro Asp Thr Gly Leu He Thr Ser Asp Ser Asp Asn He Asn Met Trp 
-25 230 235 240 

He Arg Asp Phe Tyr He Phe Ala Lys Glu Leu Asp Gly Lys Asp He 
245 250 255 

Asn He Leu Phe Asn Ser Leu Gin Tyr Thr Asn Val Val Lys Asp Tyr 
260 265 270 

Trp Gly Asn Asp Leu Arg Tyr Asn Lys Glu Tyr Tyr Met Val Asn He 
275 280 285 

Asp Tyr Leu Asn Arg Tyr Met Tyr Ala Asn Ser Arg Gin lie Val Phe 
290 295 300 

Asn Thr Arg Arg Asn Asn Asn Asp Phe Asn Glu Gly Tyr Lys He He 
^05 310 315 320 

He Lys Arg He Arg Gly Asn Thr Asn Asp Thr Arg Val Arg Gly Gly 
325 330 335 

Asp He Leu Tyr Phe Asp Met Thr He Asn Asn Lys Ala Tyr Asn Leu 
340 345 350 

Phe Met Lys Asn Glu Thr Met Tyr Ala Asp Asn His Ser Thr Glu Asp 
355 360 365 

He Tyr Ala He Gly Leu Arg Glu Gin Thr Lys Asp He Asn Asp Asn 
370 375 380 

He He Phe Gin He Gin Pro Met Asn Asn Thr Tyr Tyr Tyr Ala Ser 
385 390 395 400 

Gin He Phe Lys Ser Asn Phe Asn Gly Glu Asn He Ser Gly He Cys 
405 410 415 

Ser He Gly Thr Tyr Arg Phe Arg Leu Gly Gly Asp Trp Tyr Arg His 
420 425 430 

Asn Tyr Leu Val Pro Thr Val Lys Gin Gly Asn Tyr Ala Ser Leu Leu 
435 440 445 

Glu Ser Thr Ser Thr His Trp Gly Phe Val Pro Val Ser Glu 
450 455 460 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

CGCCATGGCT TTATTAAAAG ATATAATTAA TG 3 2 

(2) INFORMATION FOR SEQ ID NO:64: 

!5 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

GCAAGCTTTT ATTCACTTAC AGGTACAAAA CC 32 

12) INFORMATION FOR SEQ ID NO: 65: 

:>o 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3B31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 
35 (D) TOPOLOGY: linear 

ill) MOLECULE TYPE: DNA (genomic) 

tlx) FEATURE: 
40 [A) NAME /KEY: CDS 

CB) LOCATION: i . . 3828 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

45 ATG ACA TGG CCA GTA AAA GAT TTT AAT TAT AGT GAT CCT GTT AAT GAG 4 8 

Met Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Vai Asn Asp 
i 5 10 IS 

AAT GAT ATA TTA TAT TTA AG A ATA CCA CAA AAT AAG TTA ATT ACT ACA 96 
50 Asn Asp He Leu Tyr Leu Arg He Pro Gin Asn Lys Leu He Thr Thr 

20 25 30 

CCT GTA AAA GCT TTT ATG ATT ACT CAA AAT ATT TGG GTA ATA CCA GAA 14 4 

Pro Val Lys Ala Phe Met He Thr Gin Asn He Trp Val He Pro Glu 
55 35 40 45 

AGA TTT TCA TCA GAT ACT AAT CCA AGT TTA AGT AAA CCG CCC AGA CCT 192 
Arg Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro 
50 55 60 



60 



ACT TCA AAG TAT CAA AGT TAT TAT GAT CCT AGT TAT TTA TCT ACT GAT 24 0 

Thr Ser Lys Tyr Gin Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp 
65 70 75 80 



65 GAA CAA AAA GAT ACA TTT TTA AAA GGG ATT ATA AAA TTA TTT AAA AGA 2 88 

Glu Gin Lys Asp Thr Phe Leu Lys Gly He He Lys Leu Phe Lys Arg 
85 90 95 



70 



ATT AAT GAA AGA GAT ATA GGA AAA AAA TTA ATA AAT TAT TTA GTA GTT 3 36 

lie Asn Glu Arg Asp He Gly Lys Lys Leu He Asn Tyr Leu Vai Val 
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100 105 110 

GGT TCA CCT TTT ATG GGA GAT TCA AGT ACG CCT GAA GAT ACA TTT GAT 3 84 

Gly Ser Pro Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp 
115 120 125 

TTT ACA CGT CAT ACT ACT AAT ATT GCA GTT GAA AAG TTT GAA AAT GGT 432 
Phe Thr Arg His Thr Thr Asn lie Ala Val Glu Lys Phe Glu Asn Gly 
130 135 140 

AGT TGG AAA GTA ACA AAT ATT ATA ACA CCA AGT GTA TTG ATA TTT GGA 4 80 

Ser Trp Uys Val Thr Asn lie lie Thr Pro Ser Val Leu lie Phe Gly 
145 150 155 160 

15 CCA CTT CCT AAT ATA TTA GAC TAT ACA GCA TCC CTT ACA TTG CAA GGA 528 

Pro Leu Pro Asn He Leu Asp Tyr Thr Ala Ser Leu Thr Leu Gin Gly 
165 170 175 

CAA CAA TCA AAT CCA TCA TTT GAA GGG TTT GGA ACA TTA TCT ATA CTA 576 
20 Gin Gin Ser Asn Pro Ser Phe Glu Gly Phe Gly Thr Leu Ser He Leu 

180 165 190 

AAA GTA GCA CCT GAA TTT TTG TTA ACA TTT AGT GAT GTA ACA TCT AAT 624 
Lys Val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser Asn 
25 195 200 " 205 

CAA AGT TCA GCT GTA TTA GGC AAA TCT ATA TTT TGT ATG GAT CCA GTA 6 72 

Gin Ser Ser Ala Val Leu Gly Lys Ser He Phe Cys Met Asp Pro Val 

210 215 220 

M) 

ATA GCT TTA ATG CAT GAG TTA ACA CAT TCT TTG CAT CAA TTA TAT GGA 72 0 

He Ala Leu Met His Glu Leu Thr His Ser Leu His Gin Leu Tyr Gly 

225 230 235 240 

35 ATA AAT ATA CCA TCT GAT AAA AGG ATT CGT CCA CAA GTT AGC GAG GGA 76 8 

He Asn He Pro Ser Asp Lys Arg He Arg Pro Gin Val Ser Glu Gly 
245 250 255 

TTT TTC TCT CAA GAT GGA CCC AAC GTA CAA TTT GAG GAA TTA TAT ACA 816 
40 Phe Phe Ser Gin Asp Gly Pro Asn Val Gin Phe Glu Glu Leu Tyr Thr 

260 265 270 

TTT GGA GGA TTA GAT GTT GAA ATA ATA CCT CAA ATT GAA AGA TCA CAA 864 
Phe Gly Gly Leu Asp Val Glu He He Pro Gin He Glu Arg Ser Gin 
45 275 280 285 

TTA AGA GAA AAA GCA TTA GGT CAC TAT AAA GAT ATA GCG AAA AGA CTT 912 

Leu Arg Glu Lys Ala Leu Gly His Tyr Lys Asp He Ala Lys Arg Leu 

290 295 300 

50 

AAT AAT ATT AAT AAA ACT ATT CCT TCT AGT TGG ATT AGT AAT ATA GAT 960 

Asn Asn He Asn Lys Thr He Pro Ser Ser Trp He Ser Asn He Asp 

305 310 315 320 

55 AAA TAT AAA AAA ATA TTT TCT GAA AAG TAT AAT TTT GAT AAA GAT AAT 1008 

Lys Tyr Lys Lys He Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn 
325 330 335 

ACA GGA AAT TTT GTT GTA AAT ATT GAT AAA TTC AAT AGC TTA TAT TCA 1056 
f>0 Thr Gly Asn Phe Val Val Asn He Asp Lys Phe Asn Ser Leu Tyr Ser 

340 345 350 
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GAC TTG ACT AAT GTT ATG TCA GAA GTT GTT TAT TCT TCG CAA TAT AAT 
Asp Leu Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gin Tyr Asn 
355 360 365 

GTT AAA AAC AGG ACT CAT TAT TTT TCA AGG CAT TAT CTA CCT GTA TTT 
Val Lys Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe 
370 375 380 

GCA AAT ATA TTA GAT GAT AAT ATT TAT ACT ATA AGA GAT GGT TTT AAT 
Ala Asn lie Leu Asp Asp Asn He Tyr Thr He Arg Asp Gly Phe Asn 
^85 390 395 400 

TTA ACA AAT AAA GGT TTT AAT ATA GAA AAT TCG GGT CAG AAT ATA GAA 
Leu Thr Asn Lys Gly Phe Asn He Glu Asn Ser Gly Gin Asn He Glu 
405 410 415 

AGG AAT CCT GCA CTA CAA AAG CTT AGT TCA GAA AGT GTA GTA GAT TTA 
Arg Asn Pro Ala Leu Gin Lys Leu Ser Ser Glu Ser Val Val Asp Leu 
420 425 430 

TTT ACA AAA GTA TGT TTA AGA TTA ACA AAA AAT AGT AGA GAT GAT TCA 
Phe Thr Lys Val Cys Leu Arg Leu Thr Lys Asn Ser Arg Asp Asp Ser 
435 440 445 

ACA TGT ATT AAA GTT AAA AAT AAT AGA TTA CCT TAT GTA GCT GAT AAA 
Thr Cys He Lys Val Lys Asn Asn Arg Leu Pro Tyr Val Ala Asp Lys 
450 455 460 

GAT AGC ATT TCA CAA GAA ATA TTT GAA AAT AAA ATT ATT ACA GAT GAG 
Asp Ser He Ser Gin Glu He Phe Glu Asn Lys He He Thr Asp Glu 
465 470 475 480 

ACT AAT GTA CAA AAT TAT TCA GAT AAT TTT TCA TTA GAT GAA TCT ATT 
Thr Asn Val Gin Asn Tyr Ser Asp Ar.n Phe Ser Leu Asp Glu Ser He 
4B5 490 495 

TTA GAT GGG CAA GTT CCT ATT AAT CCT GAA ATA GTA GAT CCA CTA TTA 
Leu Asp Gly Gin Val Pro He Asn Pro Glu He Val Asp Pro Leu Leu 
500 505 510 

CCC AAT GTT AAT ATG GAA CCT TTA AAT CTT CCA GGT GAA GAA ATA GTA 
Pro Asn Val Asn Met Glu Pro Leu Asn Leu Pro Gly Glu Glu He Val 
515 520 525 

TTT TAT GAT GAT ATT ACT AAA TAT GTT GAT TAT TTA AAT TCT TAT TAT 
Phe Tyr Asp Asp He Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tyr Tyr 
530 535 540 

TAT TTG GAA TCT CAA AAA TTA AGT AAT AAT GTT GAA AAT ATT ACT CTT 
Tyr Leu Glu Ser Gin Lys Leu Ser Asn Asn Val Glu Asn He Thr Leu 
545 550 555 560 

ACA ACT TCA GTT GAA GAA GCA TTA GGT TAT AGC AAT AAG ATA TAC ACA 
Thr Thr Ser Val Glu Glu Ala Leu Gly Tyr Ser Asn Lys He Tyr Thr 
565 570 575 

TTT TTA CCT AGC TTA GCT GAA AAA GTG AAT AAA GGT GTT CAA GCA GGT 
Phe Leu Pro Ser Leu Ala Glu Lys Val Asn Lys Gly Val Gin Ala Gly 
580 585 590 

TTA TTC TTA AAT TGG GCG AAT GAA GTA GTT GAG GAT TTT ACT ACA AAT 
Leu Phe Leu Asn Trp Ala Asn Glu Val Val Glu Asp Phe Thr Thr Asn 
595 600 605 

ATT ATG AAG AAA GAT ACA TTG GAT AAA ATA TCA GAT GTA TCA GTA ATA 
He Met Lys Lys Asp Thr Leu Asp Lys He Ser Asp Val Ser Val He 
610 615 620 

ATT CCA TAT ATA GGA CCT GCC TTA AAT ATA GGA AAT TCA GCA TTA AGG 
He Pro Tyr He Gly Pro Ala Leu Asn He Gly Asn Ser Ala Leu Arg 
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625 630 635 640 

GGA AAT TTT AAG CAA GCA TTT GCA ACA GCT GGT GTA GCT TTT TTA TTA 196 9 

Gly Asn Phe hys Gin Ala Phe Ala Thr Ala Gly Val Ala Phe Leu Leu 
645 650 655 

GAG GGA TTT CCA GAG TTT ACT ATA CCT GCA CTC GGT GTA TTT ACC TTT 201G 
Glu Gly Phe Pro Glu Phe Thr lie Pro Ala Leu Gly Val Phe Thr Phe 
660 665 670 

TAT AGT TCT ATT CAA GAA AGA GAG AAA ATT ATT AAA ACT ATA GAA AAT 2064 
Tyr Ser Ser lie Gin Glu Arg Glu Lys lie lie Lys Thr He Qlu Asn 
675 680 6B5 

15 TGT TTG GAA CAA AGA GTT AAG AGA TGG AAA GAT TCA TAT CAA TGG ATG 2112 

Cys Leu Glu Gin Arg Val Lys Arg Trp Lys Asp Ser Tyr Gin. Trp Met 
690 695 700 

GTA TCA AAT TGG TTG TCA AGA ATT ACT ACT CAA TTT AAT CAT ATA AAT 2160 
20 Val Ser Asn Trp Leu Ser Arg He Thr Thr Gin Phe Asn His He Asn 

705 710 715 720 

TAT CAA ATG TAT GAT TCT TTA AGT TAT CAG GCA GAT GCA ATC AAA GCT 2208 
Tyr Gin Met Tyr Asp Ser Lou Ser Tyr Gin Ala Asp Ala He Lys Ala 
2^ 725 730 735 

AAA ATA GAT TTA GAA TAT AAA AAA TAC TCA GGA AGT GAT AAA GAA AAT 2256 

Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lys Glu Asn 

740 745 750 

30 

ATA AAA AGT CAA GTT GAA AAT TTA AAA AAT AGT TTA GAT GTA AAA ATT 2 3 04 

He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys He 

755 760 765 

35 TCG GAA GCA ATG AAT AAT ATA AAT AAA TTT ATA CGA GAA TGT TCT GTA 2352 

Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg Glu Cys Scr Val 
770 775 780 

ACA TAC TTA TTT AAA AAT ATG CTC CCT AAA GTA ATT GAG GAA TTA AAT 24 00 

40 Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He Asp Glu Leu Asn 

785 * 790 795 800 

AAG TTT GAT TTA AGA ACT AAA ACA GAA TTA ATT AAT CTT ATA GAT AGT 244 8 

Lys Phe Asp Leu Arg Thr Lys Thr Glu Leu He Asn Leu He Asp Ser 
45 805 810 815 

CAT AAT ATT ATT CTA GTT GGT GAA GTA GAT AGA TTA AAA GCA AAA GTA 24 96 

His Asn He He Leu Val Gly Glu Val Asp Arg Leu Lys Ala Lys Val 
820 825 830 

50 

AAT GAG AGT TTT GAA AAT ACA ATG CCT TTT AAT ATT TTT TCA TAT ACT 254 4 

Asn Glu Ser Phe Glu Asn Thr Met Pro Phe Asn He Phe Ser Tyr Thr 
835 840 845 

55 AAT AAT TCT TTA TTA AAA GAT ATA ATT AAT GAA TAT TTC AAT AGT ATT 2592 

Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr Phe Asn Ser He 
850 855 860 

AAT GAT TCA AAA ATT TTG AGC TTA CAA AAC AAA AAA AAT GCT TTA GTG 2640 
60 Asn Asp Ser Lys He Leu Ser Leu Gin Asn Lys Lys Asn Ala Leu Val 

865 870 875 880 

GAT ACA TCA GGA TAT AAT GCA GAA GTG AGG GTA GGA GAT AAT GTT CAA 268 8 

Asp Thr Ser Gly Tyr Asn Ala Glu Val Arg Val Gly Asp Asn Val Gin 
65 885 890 895 

CTT AAT ACG ATA TAT ACA AAT GAC TTT ATvA TTA AGT AGT TCA GGA GAT 2736 

Leu Asn Thr He Tyr Thr Asn Asp Phe . Lys Leu Ser Ser Ser Gly Asp 
900 905 910 

70 
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AAA ATT ATA GTA AAT TTA AAT AAT AAT ATT TTA TAT AGC GCT ATT TAT 2 784 

Lys lie lie Val Asn Leu Asn Asn Asn lie Leu Tyr Ser Ala lie Tyr 
915 920 925 

5 GAG AAC TCT AGT GTT AGT TTT TGG ATT TUVG ATA TCT AAA GAT TTA ACT 2832 

Glu Asn Ser Ser Val Ser Phe Trp lie Lys lie Ser Lys Asp Leu Thr 
930 935 940 

AAT TCT CAT AAT GAA TAT AC A ATA ATT AAC AGT ATA GAA CAA AAT TCT 2 BSC 

10 Asn Ser His Asn Glu Tyr Thr He He Asn Ser He Glu Gin Asn Ser 

945 950 955 960 

GGG TGG AAA TTA TGT ATT AGG AAT GGC AAT ATA GAA TGG ATT TTA CAA 2 92 8 

Gly Trp Lys Leu Cys He Arg Asn Gly Asn He Glu Trp He Leu Gin 
\5 965 970 975 

GAT GTT AAT AGA AAG TAT AAA AGT TTA ATT TTT GAT TAT AGT GAA TCA 2976 
Asp Val Asn Arg Lys Tyr Lys Ser Leu He Phe Asp Tyr Ser Glu Ser 
980 985 990 

20 

TTA AGT CAT ACA GGA TAT ACA AAT AAA TGG TTT TTT GTT ACT ATA ACT 3024 
Leu Ser His Thr Gly Tyr Thr Asn Lys Trp Phe Phe Val Thr He Thr 
995 1000 ^ 1005 

25 AAT AAT ATA ATG GGG TAT ATG AAA CTT TAT ATA AAT GGA GAA TTA AAG 3072 

Asn Asn He Met Gly Tyr Met Lys Leu Tyr He Asn Gly Glu Leu Lys 
iOlO 1015 1020 

CAG AGT CAA AAA ATT GAA GAT TTA GAT GAG GTT AAG TTA GAT AAA ACC 3120 
30 Gin Ser Gin Lys He Glu Asp Leu Asp Glu Val Lys Leu Asp Lys Thr 

1025 1030 1035 1040 

ATA GTA TTT GGA ATA GAT GAG AAT ATA GAT GAG AAT CAG ATG CTT TGG 3168 
He val Phe Gly He Asp Glu Asn He Asp Glu Asn Gin Met Leu Trp 
3^ 1045 1050 1055 

ATT AGA GAT TTT AAT ATT TTT TCT AAA GAA TTA AGT AAT GAA GAT ATT 3216 

He Arg Asp Phe Asn He Phe Ser Lys Glu Leu Ser Asn Glu Asp He 
1060 1065 1070 

40 

AAT ATT GTA TAT GAG GGA CAA ATA TTA AGA AAT GTT ATT AAA GAT TAT 3264 

Asn He Val Tyr Glu Gly Gin He Leu Arg Asn Val He Lys Asp Tyr 
1075 1080 1085 

45 TGG GGA AAT CCT TTG AAG TTT GAT ACA GAA TAT TAT ATT ATT AAT GAT 3 312 

Trp Gly Asn Pro Leu Lys Phe Asp Thr Glu Tyr Tyr He He Asn Asp 
1090 ' 1095 1100 

AAT TAT ATA GAT AGG TAT ATT GCA CCT GAA AGT AAT GTA CTT GTA CTT 3 360 

50 Asn Tyr He Asp Arg Tyr He Ala Pro Glu Ser Asn Val Leu Val Leu 

1105 1110 1115 1120 

GTT CGG TAT CCA GAT AGA TCT AAA TTA TAT ACT GGA AAT CCT ATT ACT 34 08 

Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly Asn Pro He Thr 
55 1125 ' 1130 1135 

ATT AAA TCA GTA TCT GAT AAG AAT CCT TAT AGT AGA ATT TTA AAT GGA 34 56 

He Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg He Leu Asn Gly 
1140 1145 1150 



f>f) 



GAT AAT ATA ATT CTT CAT ATG TTA TAT AAT AGT AGG AAA TAT ATG ATA 3 504 

Asp Asn He He Leu His Met Leu Tyr Asn Ser Arg Lys Tyr Met He 
1155 1160 1165 



65 ATA AGA GAT ACT GAT ACA ATA TAT GCA ACA CAA GGA GGA GAG TGT TCA 3 5 52 

He Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Gly Gly Glu Cys Ser 
1170 1175 1180 

CAA AAT TGT GTA TAT GCA TTA AAA TTA CAG AGT AAT TTA GGT AAT TAT 3600 
70 Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn Leu Gly Asn Tyr 
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45 



60 



1185 1190 1195 1200 

GGT ATA GGT ATA TTT AGT ATA AAA AAT ATT GTA TCT AAA AAT AAA TAT 3 648 

Gly lie Giy lie Phe Ser lie Lys Asn lie Val Ser Lys Asn Lys Tyr 
120S 1210 1215 

TGT AGT CAA ATT TTC TCT AGT TTT AGG GAA AAT ACA ATG CTT CTA GCA 3 6 96 

Cys Ser Gin lie Phe Ser Ser Phe Arg Glu Asn Thr Met Leu Leu Ala 
1220 1225 1230 

GAT ATA TAT AAA CCT TGG AGA TTT TCT TTT AAA AAT GCA TAC ACG CCA 3 744 

Asp He Tyr Lys Pro Trp Arg Phe Ser Phe Lys Asn Ala Tyr Thr Pro 
1235 1240 1245 



15 GTT GCA GTA ACT AAT TAT GAA ACA AAA CTA TTA TCA ACT TCA TCT TTT 3 792 

Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser Thr Ser Ser Phe 
1250 1255 1260 

TGG AAA TTT ATT TCT AGG GAT CCA GGA TGG GTA GAG TAA 33 31 

20 Trp Lys Phe He Ser Arg Asp Pro Gly Trp Val Glu 

1265 1270 1275 

(2) INFORMATION FOR SEQ ID NO: 66: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1276 araxno acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: protein 

{xij SEQUENCE DESCRIPTION: SEQ ID NO:66; 

Met Thr Trp Pro Val Lys Asp Phe Asn Tyr Ser Asp Pro Val Asn Asp 
35 15 10 15 

Asn Asp lie Leu Tyr Leu Arg He Pro Gin Asn Lys Leu He Thr Thr 
20 ' 25 30 

40 Pro Val Lys Ala Phe Met He Thr Gin Asn He Trp Val He Pro Glu 

35 40 45 



Arg Phe Ser Ser Asp Thr Asn Pro Ser Leu Ser Lys Pro Pro Arg Pro 
50 55 60 

Thr Ser Lvs Tvr Gin Ser Tyr Tyr Asp Pro Ser Tyr Leu Ser Thr Asp 
65 ' ' 70 75 80 



Glu Gin Lys Asp Thr Phe Leu Lys Gly He lie Lys Leu Phe Lys Arg 
50 85 90 95 

He Asn Glu Arg Asp He Gly Lys Lys Leu He Asn Tyr Leu Val Val 

100 105 110 

55 Gly Ser Pro Phe Met Gly Asp Ser Ser Thr Pro Glu Asp Thr Phe Asp 

115 120 125 



Phe Thr Arg His Thr Thr Asn He Ala Val Glu Lys Phe Glu Asn Gly 

130 135 140 

Ser Trp Lys Val Thr Asn He He Thr Pro Ser Val Leu He Phe Gly 

145 150 155 160 



Pro Leu Pro Asn He Leu Asp Tyr Thr Ala Ser Leu Thr Leu Gin Gly 
65 165 170 175 

Gin Gin Ser Asn Pro Ser Phe Glu Gly Phe giy Thr Leu Ser He Leu 
180 185 190 

70 Lys Val Ala Pro Glu Phe Leu Leu Thr Phe Ser Asp Val Thr Ser Asn 
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195 200 205 

Gin Ser Ser Ala Val Leu Gly Lys Ser He Phe Cys Met Asp Pro Val 
210 215 220 

He Ala Leu Met His Glu Leu Thr His Ser Leu His Gin Leu Tyr Glv 
-5 230 235 240 

He Asn He Pro Ser Asp Lys Arg He Arg Pro Gin Val Ser Glu Gly 
245 250 255 

Phe Phe Ser Gin Asp Gly Pro Asn Val Gin Phe Glu Glu Leu Tyr Thr 
260 265 270 

Phe Gly Gly Lea Asp Val Glu He He Pro Gin He Glu Arg Ser- Gin 
275 280 285 

Leu Arg Glu Lys Ala Leu Gly His Tyr Lys Asp He Ala Lys Arq Leu 
290 295 300 

Asn Asn He Asn Lys Thr He Pro Ser Ser Trp He Ser Asn He Asp 
305 310 315 320 

Lys Tyr Lys Lys He Phe Ser Glu Lys Tyr Asn Phe Asp Lys Asp Asn 
325 330 335 

Thr Gly Asn Phe Val Val Asn He Asp Lys Phe Asn Ser Leu Tvr Ser 
340 345 350 

Asp Leu Thr Asn Val Met Ser Glu Val Val Tyr Ser Ser Gin Tyr Asn 
355 360 365 

Val Lys Asn Arg Thr His Tyr Phe Ser Arg His Tyr Leu Pro Val Phe 
370 375 380 

Ala Asn He Leu Asp Asp Asn He Tyr Thr He Arg Asp Gly Phe Asn 
385 390 395 ^ 400 

Leu Thr Asn Lys Gly Phe Asn He Glu Asn Ser Gly Gin Asn He Glu 
405 410 415 

Arg Asn Pro Ala Leu Gin Lys Leu Ser Ser Glu Ser Val Val Asp Leu 
^20 425 430 

Phe Thr Lys Val Cys Leu Arg Leu Thr Lys Asn Ser Arg Asp Asp Ser 
435 440 445 

Thr Cys He Lys Val Lys Asn Asn Arg Leu Pro Tvr Val Ala Asp Lys 
450 455 460 

Asp Ser He Ser Gin Glu He Phe Glu Asn Lys He He Thr Asp Glu 
465 470 475 480 

Thr Asn Val Gin Asn Tyr Ser Asp Asn Phe Ser Leu Asp Glu Ser He 
485 490 495 

Leu Asp Gly Gin Val Pro He Asn Pro Glu He Val Asp Pro Leu Leu 
500 505 510 

Pro Asn Val Asn Met Glu Pro Leu Asn Leu Pro Gly Glu Glu He Val 

520 525 

Phe Tyr Asp Asp He Thr Lys Tyr Val Asp Tyr Leu Asn Ser Tyr Tyr 
530 535 540 

Tyr Leu Glu Ser Gin Lys Leu Ser Asn Asn Val Glu Asn He Thr Leu 
550 555 560 

Thr Thr Ser Val Glu Glu Ala Leu Gly Tyr Ser Asn Lys He Tyr Thr 
565 570 575 
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Phe Leu Pro Ser Leu Ala Glu Lys Val Asn Lys Gly Val Gin Ala Gly 
580 585 590 

Leu Phe Leu Asn Trp Ala Asn Glu Val Val Glu Asp Phe Thr Thr Asn 
595 600 605 

lie Met Lys Lys Asp Thr Leu Asp Lys lie Ser Asp Val Ser Val lie 
610 615 620 

He Pro Tyr lie Gly Pro Ala Leu Asn He Gly Asn Ser Ala Leu Arq 
625 630 635 640 

Gly Asn Phe Lys Gin Ala Phe Ala Thr Ala Gly Val Ala Phe Leu Leu 
645 650 655 

Glu Gly Phe Pro Glu Phe Thr He Pro Ala Leu Gly Val Phe Thr Phe 
660 665 670 

Tyr Ser Ser lie Gin Glu Arg Glu Lys He He Lys Thr He Glu Asn 
675 680 685 

Cys Leu Glu Gin Arg Val Lys Arg Trp Lys Asp Ser Tyr Gin Trp Met 
690 695 700 

Val Ser Asn Trp Leu Ser Arg He Thr Thr Gin Phe Asn His He Asn 
705 710 715 720 

Tyr Gin Met Tyr Asp Ser Leu Ser Tyr Gin Ala Asp Ala He Lys Ala 
72S 730 735 

Lys He Asp Leu Glu Tyr Lys Lys Tyr Ser Gly Ser Asp Lvs Glu Asn 
740 745 750 

He Lys Ser Gin Val Glu Asn Leu Lys Asn Ser Leu Asp Val Lys He 
755 760 765 

Ser Glu Ala Met Asn Asn He Asn Lys Phe He Arg Glu Cys Ser Val 
770 775 780 

Thr Tyr Leu Phe Lys Asn Met Leu Pro Lys Val He Asp Glu Leu Asn 
^85 790 795 800 

Lys Phe Asp Leu Arg Thr Lys Thr Glu Leu He Asn Leu He Asp Ser 
805 810 815 

His Asn He He Leu Val Gly Glu Val Asp Arg Leu Lys Ala Lys Val 
820 825 830 

Asn Glu Ser Phe Glu Asn Thr Met Pro Phe Asn He Phe Ser Tyr Thr 
835 840 845 

Asn Asn Ser Leu Leu Lys Asp He He Asn Glu Tyr Phe Asn Ser He 
850 855 860 

Asn Asp Ser Lys He Leu Ser Leu Gin Asn Lys Lys Asn Ala Leu Val 
865 870 875 880 

Asp Thr Ser Gly Tyr Asn Ala Glu Val Arg Val Gly Asp Asn Val Gin 
885 890 895 

Leu Asn Thr He Tyr Thr Asn Asp Phe Lys Leu Ser Ser Ser Gly Asp 
900 905 910 

Lys He He Val Asn Leu Asn Asn Asn He Leu Tyr Ser Ala He Tyr 
915 920 925 

Glu Asn Ser Ser Val Ser Phe Trp He Lys He Ser Lys Asp Leu Thr 
930 935 940 

Asn Ser His Asn Glu Tyr Thr He He Asn Ser He Glu Gin Asn Ser 
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945 950 955 960 

Gly Trp Lys Leu Cys He Arg Asn Gly Asn He Glu Trp He Leu Gin 
. 965 970 975 

Asp Val Asn Arg Lys Tyr Lys Ser Leu He Phe Asp Tyr Ser Glu Ser 
980 985 990 

Leu Ser His Thr Gly Tyr Thr Asn Lys Trp Phe Phe Val Thr He Thr 
995 1000 1005 



15 



Asn Asn He Met Gly Tyr Met Lys Leu Tyr He Asn Gly Glu Leu Lys 
1010 1015 1020 

Gin Ser Gin Lys He Glu Asp Leu Asp Glu Val Lys Leu Asp Lys Thr 
1025 1030 1035 1040 

He Val Phe Gly He Asp Glu Asn He Asp Glu Asn Gin Met Leu Trp 
1045 1050 1055 

He Arg Asp Phe Asn He Phe Ser Lys Glu Leu Ser Asn Glu Asp He 
1060 1065 1070 

^ Asn He Val Tyr Glu Gly Gin He Leu Arg Asn Val He Lys Asp Tyr 

1075 1080 1085 

Trp Gly Asn Pro Leu Lys Phe Asp Thr Glu Tyr Tyr He He Asn Asp 
1090 1095 1100 

r>() Asn Tyr He Asp Arg Tyr He Ala Pro Glu Ser Asn Val Leu Val Leu 

1105 1110 xiXS 1120 

Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly Asn Pro He Thr 
1125 1130 1135 

He Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg He Leu Asn Gly 
1140 1145 1150 

Asp Asn He He Leu His Met Leu Tyr Asn Ser Arg Lys Tyr Met Ho 
1155 1160 1165 

He Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Gly Gly Glu Cys Ser 
1170 1175 1180 

4:> Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn Leu Gly Asn Tyr 

1185 1190 1195 1200 



50 



60 



Gly He Gly He Phe Ser He Lys Asn He Val Ser Lys Asn Lys Tyr 
1205 1210 1215 

Cys Ser Gin He Phe Ser Ser Phe Arg Glu Asn Thr Met Leu Leu Ala 
1220 1225 1230 

Asp He Tyr Lys Pro Trp Arg Phe Ser Phe Lys Asn Ala Tyr Thr Pro 
1235 1240 1245 

Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser Thr Ser Ser Phe 
1250 1255 1260 

Trp Lys Phe He Ser Arg Asp Pro Gly Trp Val Glu 

1265 1270 1275 
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(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: DNA (genomic) 

tix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 108.. 1460 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 6 0 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 
1 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 
His His His His His His His His His Ser Ser Gly His lie Glu Glv 
5 10 15 

CGT CAT ATG GCT AGC ATG GCT TTA TTA AAA GAT ATA ATT AAT GAA TAT 212 
Arg His Met Ala Ser Met Ala Leu Leu Lys Asp lie He Asn Glu Tyr 
20 25 30 35 

TTC AAT AGT ATT AAT GAT TCA AAA ATT TTG AGC TTA CAA AAC AAA AAA 26 0 

Phe Asn Ser He Asn Asp Ser Lys He Leu Ser Leu Gin Asn Lys Lys 
40 45 50 

3.'> AAT GCT TTA GTG GAT ACA TCA GGA TAT AAT GCA GAA GTG AGG GTA GGA 308 

Asn Ala Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val Arg Val Gly 
55 60 65 



30 



40 



50 



60 



GAT AAT GTT CAA CTT AAT ACG ATA TAT ACA AAT GAC TTT AAA TTA AGT 3 56 

Asp Asn Val Gin Leu Asn Thr He Tyr Thr Asn Asp Phe Lys Leu Ser 
70 75 80 



AGT TCA GGA GAT AAA ATT ATA GTA AAT TTA AAT AAT AAT ATT TTA TAT 4 04 

Ser Ser Gly Asp Lys He He Val Asn Leu Asn Asn Asn He Leu Tyr 
85 90 95 

AGC GCT ATT TAT GAG AAC TCT AGT GTT AGT TTT TGG ATT AAG ATA TCT 4 52 

Ser Ala He Tyr Glu Asn Ser Ser Val Ser Phe Trp He Lys He Ser 
100 105 110 115 

AAA GAT TTA ACT AAT TCT CAT AAT GAA TAT ACA ATA ATT AAC AGT ATA 500 
Lys Asp Leu Thr Asn Ser His Asn Glu Tyr Thr He He Asn Ser He 
120 125 130 



3> GAA CAA AAT TCT GGG TGG AAA TTA TGT ATT AGG AAT GGC AAT ATA GAA 

Glu Gin Asn Ser Gly Trp Lys Leu Cys He Arg Asn Gly Asn He Glu 
135 140 145 



548 



TGG ATT TTA CAA GAT GTT AAT AG A AAG TAT AAA AGT TTA ATT TTT GAT 596 
Trp He Leu Gin Asp Val Asn Arg Lys Tyr Lys Ser Leu He Phe Asp 
150 155 160 



TAT AGT GAA TCA TTA AGT CAT ACA GGA TAT ACA AAT AAA TGG TTT TTT 644 
Tyr Ser Glu Ser Leu Ser His Thr Gly Tyr Thr Asn Lys Trp Phe Phe 
(» 165 170 175 



70 



GTT ACT ATA ACT AAT AAT ATA ATG GGG TAT ATG AAA CTT TAT ATA AAT 6 92 

Val Thr He Thr Asn Asn He Met Gly Tyr Met Lvs Leu Tyr He Asn 
180 185 190 ' ' 195 
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GGA GAA TTA AAG CAG AGT CAA AAA ATT GAA GAT TTA GAT GAG GTT AAG 74 0 

Gly Glu Leu Lys Gin Ser Gin Lys lie Glu Asp Leu Asp Glu Val Lys 
200 205 210 

5 TTA GAT AAA ACC ATA GTA TTT GGA ATA GAT GAG AAT ATA GAT GAG AAT 788 

Leu Asp Lys Thr lie Val Phe Gly lie Asp Glu Asn lie Asp Glu Asn 
21S 220 225 

CAG ATG CTT TGG ATT AG A GAT TTT AAT ATT TTT TCT AAA GAA TTA AGT 8 36 

10 Gin Met Leu Trp lie Arg Asp Phe Asn lie Phe Ser Lys Glu Leu Ser 

230 235 240 

AAT GAA GAT ATT AAT ATT GTA TAT GAG GGA CAA ATA TTA AGA AAT GTT 8 84 

Asn Glu Asp lie Asn lie Val Tyr Glu Gly Gin lie Leu Arg Asn Val 
15 245 250 255 

ATT AAA GAT TAT TGG GGA AAT CCT TTG AAG TTT GAT ACA GAA TAT TAT 932 

He Lys Asp Tyr Trp Gly Asn Pro Leu Lys Phe Asp Thr Glu Tyr Tyr 
260 265 270 275 

20 

ATT ATT AAT GAT AAT TAT ATA GAT AGG TAT ATT GCA CCT GAA AGT AAT 98 0 

He He Asn Asp Asn Tyr He Asp Arg Tyr He Ala Pro Glu Ser Asn 
280 285 290 

25 GTA CTT GTA CTT GTT CGG TAT CCA GAT AGA TCT AAA TTA TAT ACT GGA 102 8 

Val Leu Val Leu Val Arg Tyr Pro Asp Arg Ser Lys Leu Tyr Thr Gly 
29S 300 305 

AAT CCT ATT ACT ATT AAA TCA GTA TCT GAT AAG AAT CCT TAT AGT AGA 1076 
50 Asn Pro He Thr He Lys Ser Val Ser Asp Lys Asn Pro Tyr Ser Arg 

310 315 320 

ATT TTA AAT GGA GAT AAT ATA ATT CTT CAT ATG TTA TAT AAT AGT AGG 1124 
He Leu Asn Gly Asp Asn He He Leu His Met Leu Tyr Asn Ser Arg 
55 325 ' 330 335 

AAA TAT ATG ATA ATA AGA GAT ACT GAT ACA ATA TAT GCA ACA CAA GGA 1172 

Lys Tyr Met He He Arg Asp Thr Asp Thr He Tyr Ala Thr Gin Gly 
340 345 350 355 

40 

GGA GAG TGT TCA CAA AAT TGT GTA TAT GCA TTA AAA TTA CAG AGT AAT 122 0 

Gly Glu Cys Ser Gin Asn Cys Val Tyr Ala Leu Lys Leu Gin Ser Asn 
360 365 370 

45 TTA GGT AAT TAT GGT ATA GGT ATA TTT AGT ATA AAA AAT ATT GTA TCT 1268 

Leu Glv Asn Tyr Gly He Gly He Phe Ser He Lys Asn lie Val Ser 
375 * 380 385 

AAA AAT AAA TAT TGT AGT CAA ATT TTC TCT AGT TTT AGG GAA AAT ACA 1316 
50 Lys Asn Lys Tyr Cys Ser Gin He Phe Ser Ser Phe Arg Glu Asn Thr 

390 395 400 

ATG CTT CTA GCA GAT ATA TAT AAA CCT TGG AGA TTT TCT TTT AAA AAT 1364 
Met Leu Leu Ala Asp He Tyr Lys Pro Trp Arg Phe Ser Phe Lys Asn 
55 405 410 415 

GCA TAC ACG CCA GTT GCA GTA ACT AAT TAT GAA ACA AAA CTA TTA TCA 1412 
Ala Tyr Thr Pro Val Ala Val Thr Asn Tyr Glu Thr Lys Leu Leu Ser 
420 425 430 435 



60 



ACT TCA TCT TTT TGG AAA TTT ATT TCT AGG GAT CCA GGA TGG GTA GAG 1460 
Thr Ser Ser Phe Trp Lys Phe He Ser Arg Asp Pro Gly Trp Val Glu 
440 445 450 



65 TAAAAGCTT 14 6 9 

(2) INFORMATION FOR SEQ ID NO -.68: 

(i) SEQUENCE CHARACTERISTICS: 
70 lA) LENGTH: 451 amino acids 
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10 



2:> 



40 



55 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Met Gly His His His His His His His His His His Ser Ser Gly His 
1 5 10 15 

lie Glu Gly Arg His Met Ala Ser Met Ala Leu Leu Lys Asp He He 
20 25 30 



Asn Glu Tyr Phe Asn Ser He Asn Asp Ser Lys He Leu Ser Leu Gin 

15 35 40 45 

Asn Lys Lys Asn Ala Leu Val Asp Thr Ser Gly Tyr Asn Ala Glu Val 

50 55 60 

20 Arg Val Gly Asp Asn Val Gin Leu Asn Thr He Tyr Thr Asn Asp Phe 

65 70 75 80 



Lys Leu Ser Ser Ser Gly Asp Lys He He Val Asn Leu Asn Asn Asn 
85 90 95 

He Leu Tyr Ser Ala He Tyr Glu Asn Ser Ser Val Ser Phe Trp He 
100 105 110 



Lys He Ser Lys Asp Leu Thr Asn Ser His Asn Glu Tyr Thr He He 

30 115 120 125 

Asn Ser He Glu Gin Asn Ser Gly Trp Lys Leu Cys He Arg Asn Gly 

130 135 140 

35 Asn He Glu Trp He Leu Gin Asp Val Asn Arg Lys Tyr Lys Ser Leu 

145 150 155 160 



He Phe Asp Tyr Ser Glu Ser Leu Ser His Thr Gly Tyr Thr Asn Lys 

165 170 175 

Trp Phe Phe Val Thr He Thr Asn Asn He Met Gly Tyr Met Lys Leu 

180 IBS 190 



Tyr He Asn Gly Glu Leu Lys Gin Ser Gin Lys He Glu Asp Leu Asp 
45 195 200 205 

Glu Val Lys Leu Asp Lys Thr He Val Phe Gly He Asp Glu Asn He 
210 215 220 

50 Asp Glu Asn Gin Met Leu Trp He Arg Asp Phe Asn He Phe Ser Lys 

225 230 235 240 



Glu Leu Ser Asn Glu Asp He Asn He Val Tyr Glu Gly Gin He Leu 

245 250 255 

Azg Asn Val He Lys Asp Tyr Trp Gly Asn Pro Leu Lys Phe Asp Thr 
260 265 270 



Glu Tyr Tyr He He Asn Asp Asn Tyr He Asp Arg Tyr He Ala Pro 
60 275 280 2B5 

Glu Ser Asn Val Leu Val Leu Val Arg Tyr Pro Asp Arg Ser Lys Leu 
290 295 300 

65 Tyr Thr Gly Asn Pro He Thr He Lys Ser Val Ser Asp Lys Asn Pro 

305 310 315 320 



70 



Tyr Ser Arg He Leu Asn Gly Asp Asn He He Leu His Met Leu Tyr 
325 330 335 
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Asn Ser Arg Lys Tyr Met lie lie Arg Asp Thr Asp Thr He Tyr Ala 
340 345 350 

Thr Gin Gly Gly Glu Cys Ser Gin Asn Cys Val Tyr Ala Leu Lys Leu 
5 355 360 365 

Gin Ser Asn Leu Giy Asn Tyr Gly He Gly He Phe Ser lie Lys Asn 
370 ' 375 380 

10 He Val Ser Lys Asn Lys Tyr Cys Ser Gin He Phe Ser Ser Phe Arg 

385 ' 390 395 400 



15 



Glu Asn Thr Met Leu Leu Ala Asp He Tyr Lys Pro Trp Arg Phe Ser 
405 410 415 

Phe Lys Asn Ala Tyr Thr Pro Val Ala Val Thr Asn Tyr Glu Thr Lys 
420 425 430 

Leu Leu Ser Thr Ser Ser Phe Trp Lys Phe He Ser Arg Asp Pro Gly 
20 435 440 445 

Trp Val Glu 
450 

25 {2) INFORMATION FOR SEQ ID NO: 69: 

ii) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
(BJ TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

CD) TOPOLOGY: linear 



35 



:)5 



tii) MOLECULE TYPE: Other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GCAAGCTTTT ACTCTACCCA TCCTGGATCC CT 32 
40 (2) INFORMATION FOR SEQ ID NO: 70-. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3825 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 

tii) MOLECULE TYPE; DNA (genomic) 

50 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3822 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ATG CCA GTT GCA ATA AAT AGT TTT AAT TAT AAT GAC CCT GTT AAT GAT 4 8 

Met Pro Val Ala He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp 
1 5 10 15 

60 GAT AC A ATT TTA TAG ATG CAG ATA CCA TAT GAA GAA AAA AGT AAA AAA 96 

Asp Thr He Leu Tyr Met Gin He Pro Tyr Glu Glu Lys Ser Lys Lys 
20 25 30 

TAT TAT AAA GCT TTT GAG ATT ATG CGT AAT GTT TGG ATA ATT CCT GAG 144 
65 Tyr Tyr Lys Ala Phe Glu He Met Arg Asn Val Trp He He Pro Glu 

35 40 45 

AGA AAT ACA ATA GGA ACG AAT CCT AGT GAT TTT GAT CCA CCG GCT TCA 192 
Arg Asn Thr He Gly Thr Asn Pro Ser Asp Phe Asp Pro Pro Ala Ser 
70 50 55 60 
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340 345 350 

GAG AGT GAC TTA GCA AAT AAA TTT AAA GTA AAA TGT AGA AAT ACT TAT 1104 
Glu Ser Asp Leu Ala Asn Lys Phe Lys Val Lys Cys Arg Asn Thr Tyr 
355 360 365 

TTT ATT AAA TAT GAA TTT TTA AAA GTT CCA AAT TTG TTA GAT GAT GAT 1152 
Phe He Lys Tyr Glu Phe Leu Lys Val Pro Asn Leu Leu Asp Asp Asp 
370 375 390 

ATT TAT ACT GTA TCA GAG GGG TTT AAT ATA GGT AAT TTA GCA GTA AAC 1200 
lie Tyr Thr Val Ser Glu Gly Phe Asn He Gly Asn Leu Ala V^l Asn 
385 390 395 400 

15 AAT CGC GGA CAA AGT ATA AAG TTA AAT OCT AAA ATT ATT GAT TCC ATT 1248 

Asn Arg Gly Gin Ser He Lys Leu Asn Pro Lys He He Asp Ser He 
405 410 415 

CCA GAT AAA GGT CTA GTA GAA AAG ATC GTT AAA TTT TGT AAG AGC GTT 12 96 

20 Pro Asp Lys Gly Leu Val Glu Lys He Val Lys Phe Cys Lys Ser Val 

420 425 430 

ATT CCT AGA AAA GGT ACA AAG GCG CCA CCG CGA CTA TGC ATT AGA GTA 1344 
He Pro Arg Lys Gly Thr Lys Ala Pro Pro Arg Leu Cys He Arg Val 
-5 435 440 445 

AAT AAT AGT GAG TTA TTT TTT GTA GCT TCA GAA AGT AGC TAT AAT GAA 1392 

Asn Asn Ser Glu Leu Phe Phe Val Ala Ser Glu Ser Ser Tyr Asn Glu 
. 450 455 460 

M) 

AAT GAT ATT AAT ACA CCT AAA GAA ATT GAC GAT ACA ACA AAT CTA AAT 1440 

Asn Asp He Asn Thr Pro Lys Glu He Asp Asp Thr Thr Asn Leu Asn 

-ifiS 470 475 480 

35 AAT AAT TAT AGA AAT AAT TTA GAT GAA GTT ATT TTA GAT TAT AAT AGT 1486 

Asn Asn Tyr Arg Asn Asn Leu Asp Glu Val He Leu Asp Tyr Asn Ser 
485 490 495 

CAG ACA ATA CCT CAA ATA TCA AAT CGA ACA TTA AAT ACA CTT GTA CAA 1536 
40 Gin Thr He Pro Gin He Ser Asn Arg Thr Leu Asn Thr Leu Val Gin 

500 505 510 

GAC AAT AGT TAT GTG CCA AGA TAT GAT TCT AAT GGA ACA AGT GAA ATA 1584 
Asp Asn Ser Tyr Val Pro Arg Tyr Asp Ser Asn Gly Thr Ser Glu He 
•^^ 515 520 525 



50 



GAG GAA TAT GAT GTT GTT GAC TTT AAT GTA TTT TTC TAT TTA CAT GCA 1632 

Glu Glu Tyr Asp Val Val Asp Phe Asn Val Phe Phe Tyr Leu His Ala 
530 535 540 

CAA AAA GTG CCA GAA GGT GAA ACC AAT ATA AGT TTA ACT TCT TCA ATT 1680 

Gin Lys Val Pro Glu Gly Glu Thr Asn He Ser Leu Thr Ser Ser He 

545 550 555 560 

55 GAT ACA GCA TTA TTA GAA G?JK TCC AAA GAT ATA TTT TTT TCT TCA GAG 1728 

Asp Thr Ala Leu Leu Glu Glu Ser Lys Asp lie Phe Phe Ser Ser Glu 
565 570 575 



60 



TTT ATC GAT ACT ATC AAT AAA CCT GTA AAT GCA GCA CTA TTT ATA GAT 17 76 

Phe He Asp Thr He Asn Lys Pro Val Asn Ala Ala Leu Phe lie Asp 
580 585 590 



TGG ATA AGC AAA GTA ATA AGA GAT TTT ACC ACT GAA GCT ACA CAA AAA 1824 
Trp He Ser Lys Val He Arg Asp Phe Thr Thr Glu Ala Thr Gin Lys 
«^ 595 600 605 



70 



AGT ACT GTT GAT AAG ATT GCA GAC ATA TCT TTA ATT GTA CCC TAT GTA 1872 
Ser Thr Val Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr Val 
610 615 620 
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GGT CTT GCT TTG AAT ATA ATT ATT GAG GCA GAA AAA GGA AAT TTT GAG X920 
Gly Leu Ala Leu Asn lie lie lie Glu Ala Glu Lys Gly Asn Phe Glu 
625 630 635 640 

5 GAG GCA TTT GAA TTA TTA GGA GTG GGT ATT TTA TTA GAA TTT GTG CCA 1968 

Glu Ala Phe Glu Leu Leu Gly Val Gly lie Leu Leu Glu Phe Val Pro 
645 650 655 

GAA CTT ACA ATT CCT GTA ATT TTA GTG TTT ACG ATA AAA TCC TAT ATA 2016 
10 Glu Leu Thr He Pro Val He Leu Val Phe Thr He Lys Ser Tyr He 

660 665 670 

GAT TCA TAT GAG AAT AAA AAT AAA GCA ATT AAA GCA ATA AAT AAT TCA 2064 
Asp Ser Tyr Glu Asn Lys Asn Lys Ala He Lys Ala He Asn Asn Ser 
15 675 680 685 

TTA ATC GAA AG A GAA GCA AAG TGG AAA GAA ATA TAT AGT TGG ATA GTA 2112 

Leu He Glu Arg Glu Ala Lys Trp Lys Glu He Tyr Ser Trp He Val 
690 695 700 

20 

TCA AAT TGG CTT ACT AGA ATT AAT ACT CAA TTT AAT AAA AGA AAA GAG 2160 

Ser Asn Trp Leu Thr Arg He Asn Thr Gin Phe Asn Lys Arg Lys Glu 
705 710 715 720 

25 CAA ATG TAT CAG GCT TTA CAA AAT CAA GTA GAT GCA ATA AAA ACA GCA 2208 

Gin Met Tyr Gin Ala Leu Gin Asn Gin Val Asp Ala He Lys Thr Ala 
725 730 735 

ATA GAA TAT AAA TAT AAT AAT TAT ACT TCA GAT GAG AAA AAT AGA CTT 22 56 

30 lie Glu Tyr Lys Tyr Asn Asn Tyr Thr Ser Asp Glu Lys Asn Arg Leu 

740 745 750 

GAA TCT GAA TAT AAT ATC AAT AAT ATA GAA GAA GAA TTG AAT AAA AAA 23 04 

Glu Ser Glu Tyr Asn He Asn Asn He Glu Glu Glu Leu Asn Lys Lys 
35 755 760 765 

GTT TCT TTA GCA ATG AAA AAT ATA GAA AGA TTT ATG ACA GAA AGT TCT 2352 
Val Ser Leu Ala Met Lys Asn He Glu Arg Phe Met Thr Glu Ser Ser 
770 775 780 

40 

ATA TCT TAT TTA ATG AAA TTA ATA AAT GAA GCC AAA GTT GGT AAA TTA 2400 
He Ser Tyr Leu Met Lys Leu He Asn Glu Ala Lys Val Gly Lys Leu 
785 790 795 800 

45 AAA AAA TAT GAT AAC CAT GTT AAG AGC GAT TTA TTA AAC TAT ATT CTC 2448 

Lys Lys Tyr Asp Asn His Val Lys Ser Asp Leu Leu Asn Tyr He Leu 
805 810 815 

GAC CAT AGA TCA ATC TTA GGA GAG CAG ACA AAT GAA TTA AGT GAT TTG 2496 
50 Asp His Arg Ser He Leu Gly Glu Gin Thr Asn Glu Leu Ser Asp Leu 

820 825 830 

GTG ACT AGT ACT TTG AAT AGT AGT ATT CCA TTT GAA CTT TCT TCA TAT 2544 
Val Thr Ser Thr Leu Asn Ser Ser He Pro Phe Glu Leu Ser Ser Tyr 
55 835 840 845 

ACT AAT GAT AAA ATT CTA ATT ATA TAT TTT AAT AGA TTA TAT AAA AAA 25 92 

Thr Asn Asp Lys He Leu He He Tyr Phe Asn Arg Leu Tyr Lys Lys 
850 855 860 



60 



ATT AAA GAT AGT TCT ATT TTA GAT ATG CGA TAT GAA AAT AAT AAA TTT 2640 
He Lys Asp Ser Ser He Leu Asp Met Arg Tyr Glu Asn Asn Lys Phe 
865 870 875 ' 880 
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1155 1160 1165 

AAA AAC GAT CTA GCA TAC ATT AAT GTA GTA GAT CGT GGT GTA GAA TAT 3 552 

Lys Asn Asp Leu Ala Tyr lie Asn Val Val Asp Arg Gly Val Glu Tyr 
1170 1175 IIBO 

CGG TTA TAT GCT GAT ACA AAA TCA GAG AAA GAG AAA ATA ATA AGA ACA 36 00 

Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys Glu Lys lie lie Arg Tlir 
1185 1190 1195 1200 

TCT AAT CTA AAC GAT AGC TTA GGT CAA ATT ATA GTT ATG GAT TCA ATA 364 8 

Ser Asn Leu Asn Asp Ser Leu Gly Gin lie lie Val Met Asp aer He 
1205 1210 1215 

15 GGA AAT AAT TGC ACA ATG AAT TTT CAA AAC AAT AAT GGG AGC AAT ATA 36 96 

Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Gly Ser Asn He 
1220 1225 1230 

GGA TTA CTA GGT TTT CAT TCA AAT AAT TTG GTT GCT AGT AGT TGG TAT 3744 
20 Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val Ala Ser Ser Trp Tyr 

1235 1240 1245 

TAT AAC AAT ATA CGA AGA AAT ACT AGC AGT AAT GGA TGC TTT TGG AGT 3 7 92 

Tyr Asn Asn He Arg Arg Asn Thr Ser Ser Asn Gly Cys Phe Trp Ser 
25 1250 1255 1260 

TCT ATT TCT AAA GAG AAT GGA TGG AAA GAA TGA 3825 
Ser Jle Ser Lys Glu Asn Gly Trp Lys Glu 
1265 1270 



50 



65 



(2) INFORMATION FOR SEQ ID NO: 71: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1274 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

40 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Pro Val Ala He Asn Ser Phe Asn Tyr Asn Asp Pro Val Asn Asp 
15 10 15 

45 Asp Thr He Leu Tyr Met Gin He Pro Tyr Glu Glu Lys Ser Lys Lys 

20 25 3C 



Tyr Tyr Lys Ala Phe Glu He Met Arg Asn Val Trp He He Pro Glu 
35 40 45 

Arg Asn Thr He Gly Thr Asn Pro Ser Asp Phe Asp Pro Pro Ala Ser 
50 55 60 



Leu Lys Asn Gly Ser Ser Ala Tyr Tyr Asp Pro Asn Tyr Leu Thr Thr 
55 65 70 75 80 

Asp Ala Glu Lys Asp Arg Tyr Leu Lys Thr Thr He Lys Leu Phe Lys 
85 90 95 

60 Arg He Asn Ser Asn Pro Ala Gly Lys Val Leu Leu Gin Glu He Ser 

100 105 110 



Tyr Ala Lys Pro Tyr Leu Gly Asn Asp His Thr Pro He Asp Glu Phe 

lis 120 125 

Ser Pro Val Thr Arg Thr Thr Ser Val Asn He Lys Leu Ser Thr Asn 

130 135 140 



Val Glu Ser Ser Met Leu Leu Asn Leu Leu Val Leu Gly Ala Gly Pro 
70 145 150 155 160 
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Asp He Phe Glu Ser Cys Cys Tyr Pro Val Arg Lys Leu He Asp Pro 
165 170 175 

Asp Val Val Tyr Asp Pro Ser Asn Tyr Gly Phe Glv Ser He Asn He 
180 185 ' 190 

Val Thr Phe Ser Pro Glu Tyr Glu Tyr Thr Phe Asn Asp He Ser Gly 
195 200 205 

Gly His Asn Ser Ser Thr Glu Ser Phe He Ala Asp Pro Ala He Ser 
210 215 220 

Leu Ala His Glu Leu He His Ala Leu His Gly Leu Tyr Gly Ala Arq 
-25 230 235 240 

Gly Val Thr Tyr Glu Glu Thr He Glu Val Lys Gin Ala Pro Leu Met 
245 250 255 

He Ala Glu Lys Pro He Arg Leu Glu Glu Phe Leu Thr Phe Gly Gly 
260 265 270 

Gin Asp Leu Asn He He Thr Ser Ala Met Lys Glu Lys He Tyr Asn 
275 280 285 

Asn Leu Leu Ala Asn Tyr Glu Lys He Ala Thr Arg Leu Ser Glu Val 
290 295 300 

Asn Ser Ala Pro Pro Glu Tyr Asp He Asn Glu Tyr Lys Asp Tyr Phe 
305 310 315 320 

Gin Trp Lys Tyr Gly Leu Asp Lys Asn Ala Asp Gly Ser Tyr Thr Val 
325 330 335 

Asn Glu Asn Lys Phe Asn Glu He Tyr Lys Lys Leu Tyr Ser Phe Thr 
340 345 350 

Glu Ser Asp Leu Ala Asn Lys Phe Lys Val Lys Cys Arg Asn Thr Tyr 
355 360 365 

Phe He Lys Tyr Glu Phe Leu Lys Val Pro Asn Leu Leu Asp Asp Asp 
370 375 380 

He Tyr Thr Val Ser Glu Gly Phe Asn He Gly Asn Leu Ala Val Asn 
395 390 395 400 

Asn Arg Gly Gin Ser He Lys Leu Asn Pro Lys He He Asp Ser He 
405 410 415 

Pro Asp Lys Gly Leu Val Glu Lys He Val Lys Phe Cys Lys Ser Val 
420 425 430 

He Pro Arg Lys Gly Thr Lys Ala Pro Pro Arg Leu Cys He Arg Val 
435 440 445 

Asn Asn Ser Glu Leu Phe Phe Val Ala Ser Glu Ser Ser Tyr Asn Glu 
450 455 460 

Asn Asp He Asn Thr Pro Lys Glu He Asp Asp Thr Thr Asn Leu Asn 
465 470 475 480 

Asn Asn Tyr Arg Asn Asn Leu Asp Glu Val He Leu Asp Tyr Asn Ser 
485 490 495 

Gin Thr He Pro Gin He Ser Asn Arg Thr Leu Asn Thr Leu Val Gin 
500 505 510 

Asp Asn Ser Tyr Val Pro Arg Tyr Asp Ser Asn Gly Thr Ser Glu He 
515 520 525 

Glu Glu Tyr Asp Val Val Asp Phe Asn Val Phe Phe Tyr Leu His Ala 
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530 535 



54 0 



Gin Lys Val Pro Glu Gly Glu Thr Asn He Ser Leu Thr Ser Ser He 
550 

, Asp Thr Ala Leu Leu Glu Glu Ser Lys Asp He Phe Phe Ser Ser Glu 
565 570 575 

Phe He Asp Thr He Asn Lys Pro Val Asn Ala Ala Leu Phe He Asp 
5B0 585 590 

Trp He Ser Lys Val He Arg Asp Phe Thr Thr Glu Ala Thr Gin Lys 
595 600 605 

Ser Thr Val Asp Lys He Ala Asp He Ser Leu He Val Pro Tyr Val 

615 620 

Gly Leu Ala Leu Asn He He He Glu Ala Glu Lys Gly Asn Phe Glu 
630 635 640 

Glu Ala Phe Glu Leu Leu Gly Val Gly He Leu Leu Glu Phe Val Pro 
645 650 655 

Glu Leu Thr He Pro Val He Leu Val Phe Thr He Lys Ser Tyr He 

665 670 

Asp Ser Tyr Glu Asn Lys Asn Lys Ala He Lys Ala He Asn Asn Ser 
675 680 685 

Leu He Glu Arg Glu Ala Lys Trp Lys Glu He Tyr Ser Trp He Val 
690 695 700 

Ser Asn Trp Leu Thr Arg He Asn Thr Gin Phe Asn Lys Arq Lys Glu 
'^^ 710 715 720 

Gin Met Tyr Gin Ala Leu Gin Asn Gin Val Asp Ala He Lys Thr Ala 
725 730 735 

He Glu Tyr Lys Tyr Asn Asn Tyr Thr Ser Asp Glu Lys Asn Arg Leu 
740 745 750 

Glu Ser Glu Tyr Asn He Asn Asn He Glu Glu Glu Leu Asn Lys Lys 
755 760 765 

Val Ser Leu Ala Met Lys Asn He Glu Arg Phe Met Thr Glu Ser Ser 
770 775 780 

He Ser Tyr Leu Met Lys Leu He Asn Glu Ala Lys Val Gly Lys Leu 
785 790 795 800 

Lys Lys Tyr Asp Asn His Val Lys Ser Asp Leu Leu Asn Tyr He Leu 
805 810 815 

Asp His Arg Ser He Leu Gly Glu Gla Thr Asn Glu Leu Ser Asp Leu 
820 825 830 

Val Thr Ser Thr Leu Asn Ser Ser He Pro Phe Glu Leu Ser Ser Tyr 
835 840 845 

Thr Asn Asp Lys He Leu He He Tyr Phe Asn Arg Leu Tyr Lys Lys 
850 855 860 

He Lys Asp Ser Ser He Leu Asp Met Arg Tyr Glu Asn Asn Lys Phe 
865 870 875 880 

He Asp He Ser Gly Tyr Gly Ser Asn He Ser He Asn Gly Asn Val 
885 890 895 

Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe Gly He Tyr Asn Ser Arg 
900 905 910 
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Leu Ser Glu Val Asn lie Ala Gin Asn Asn Asp lie He Tyr Asn Ser 
915 920 925 

Arg Tyr Gin Asn Phe Ser He Ser Phe Trp Val Arg He Pro Lys His 
930 935 940 

Tyr Lys Pro Met Asn His Asn Arg Glu Tyr Thr He He Asn Cys Met 
945 950 955 960 

Gly Asn Asn Asn Ser Gly Trp Lys He Ser Leu Arg Thr Val Arg Asp 
965 970 975 

Cys Glu He He Trp Thr Leu Gin Asp Thr Ser Gly Asn Lys Glu Asn 
980 985 990 

Leu He Phe Arg Tyr Glu Glu Leu Asn Arg He Ser Asn Tyr He Asn 
995 1000 1005 

Lys Trp He Phe Val Thr He Thr Asn Asn Arg Leu Gly Asn Ser Arg 
1010 1015 1020 

He Tyr He Asn Gly Asn Leu He Val Glu Lys Ser He Ser Asn Leu 
1025 1030 1035 1040 

Gly Asp He His Val Ser Asp Asn He Leu Phe Lys He Val Gly Cys 
1045 1050 1055 

Asp Asp Glu Thr Tyr Val Gly He Arg Tyr Phe Lys Val Phe Asn Thr 
1060 1065 1070 

Glu Leu Asp Lys Thr Glu He Glu Thr Leu Tyr Ser Asn Glu Pro Asp 
1075 1080 1085 

Pro Ser He Leu Lys Asn Tyr Trp Gly Asn Tyr Leu Leu Tyr Asn Lys 
1090 1095 1100 

Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr He Thr Leu 
li05 1110 1115 1120 

Asn Ser Gly He Leu Asn He Asn Gin Gin Arg Gly Val Thr Glu Gly 
1125 1130 1135 

Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Gly Val Glu Val He He 
1140 1145 1150 

Arg Lys Asn Gly Pro He Asp He Ser Asn Thr Asp Asn Phe Val Arg 
1155 1160 1165 

Lys Asn Asp Leu Ala Tyr He Asn Val Val Asp Arg Gly Val Glu Tyr 
1170 1175 1180 

Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys Glu Lys He He Arg Thr 
1185 1190 1195 1200 

Ser Asn Leu Asn Asp Ser Leu Gly Gin He He Val Met Asp Ser He 
1205 * 1210 1215 

Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Gly Ser Asn He 
1220 1225 1230 

Gly Leu LSu Gly Phe His Ser Asn Asn Leu Val Ala Ser Ser Trp Tyr 
1235 1240 1245 

Tyr Asn Asn He Arg Arg Asn Thr Ser Ser Asn Gly Cys Phe Trp Ser 
1250 1255 1260 

Ser He Ser Lys Glu Asn Gly Trp Lys Glu 
1265 1270 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

ixx) FEATURE: 
10 (A) NAME/KEY: CDS 

(B) LOCATION! 108.. 1451 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

15 AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 



60 



TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 
1 

20 

CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 
His His His His His His His His His Ser Ser Gly His lie Glu Gly 
5 10 15 

25 CGT CAT ATG GCT AGC ATG GCT ATT CTA ATT ATA TAT TTT AAT AG A TTA 212 

Arg His Met Ala Ser Met Ala He Leu He He Tyr Phe Asn Arg Leu 

20 25 30 35 

TAT AAA AAA ATT AAA GAT AGT TCT ATT TTA GAT ATG CGA TAT GAA AAT 260 
30 Tyr Lys Lys He Lys Asp Ser Ser He Leu Asp Met Arg Tyr Glu Asn 

40 45 50 

AAT AAA TTT ATA GAT ATC TCT GGA TAT GGT TCA AAT ATA AGC ATT AAT 308 
Asn Lys Phe He Asp lie Ser Gly Tyr Gly Ser Asn He Ser He Asn 
35 * 55 60 65 

GGA AAC GTA TAT ATT TAT TCA AC A AAT AG A AAT CAA TTT GGA ATA TAT 356 

Gly Asn Val Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe Gly He Tyr 

70 75 80 

40 

AAT AGT AGG CTT AGT GAA GTT AAT ATA GCT CAA AAT AAT GAT ATT ATA 4 04 

Asn Ser Arg Leu Ser Glu Val Asn He Ala Gin Asn Asn Asp He lie 

85 90 95 

45 TAC AAT AGT AG A TAT CAA AAT TTT AGT ATT AGT TTC TGG GTA AGG ATT 4 52 

Tyr Asn Ser Arg Tyr Gin Asn Phe Ser He Ser Plie Trp Val Arg He 
ibo 105 110 115 

CCT AAA CAC TAC AAA CCT ATG AAT CAT AAT CGG GAA TAC ACT ATA ATA 500 
50 Pro Lys His Tyr Lys Pro Met Asn His Asn Arg Glu Tyr Thr He He 

120 125 130 

AAT TGT ATG GGG AAT AAT AAT TCG GGA TGG AAA ATA TCA CTT AGA ACT 54 8 

Asn Cys Met Gly Asn Asn Asn Ser Gly Trp Lys He Ser Leu Arg Thr 
55 135 140 145 

GTT AGA GAT TGT GAA ATA ATT TGG ACT TTA CAA GAT ACT TCT GGA AAT 5 96 

Val Arg Asp Cys Glu He He Trp Thr Leu Gin Asp Thr Ser Gly Asn 
150 155 160 

60 

AAG GAA AAT TTA ATT TTT AGG TAT GAA GAA CTT AAT AGG ATA TCT AAT 644 
Lvs Glu Asn Leu He Phe Arg Tyr Glu Glu Leu Asn Arg He Ser Asn 
165 170 175 

65 TAT ATA AAT AAA TGG ATT TTT GTA ACT ATT ACT AAT AAT AGA TTA GGC 6 92 

Tyr He Asn Lys Trp He Phe Val Thr He Thr Asn Asn Arg Leu Gly 
IBO 185 190 195 

AAT TCT AGA ATT TAC ATC AAT GGA AAT TTA ATA GTT G7VA AAA TCA ATT 74 0 

70 Asn Ser Arg He Tyr He Asn Gly Asn Leu He Val Glu Lys Ser He 
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200 205 210 

TCG AAT TTA GGT GAT ATT CAT GTT AGT GAT AAT ATA TTA TTT AAA ATT 788 
Ser Asn Leu Gly Asp lie His Val Ser Asp Asn lle Leu Phe Lys He 
215 220 225 

GTT GGT TGT GAT GAT GAA ACG TAT GTT GGT ATA AGA TAT TTT AAA GTT 836 
Val Gly Cys Asp Asp Glu Thr Tyr Val Gly He Arg Tyr Phe Lys Val 
230 235 240 

TTT AAT ACG GAA TTA GAT AAA ACA GAA ATT GAG ACT TTA TAT AGT AAT 8 84 

Phe Asn Thr Glu Leu Asp Lys Thr Glu He Glu Thr Leu Tyr Ser Asn 
245 250 255 

GAG CCA GAT CCA AGT ATC TTA AAA AAC TAT TGG GGA AAT TAT TTG CTA 932 
Glu Pro Asp Pro Ser He Leu Lys Asn Tyr Trp Gly Asn Tyr Leu Leu 
260 265 270 275 

TAT AAT AAA AAA TAT TAT TTA TTC AAT TTA CTA AGA AAA GAT AAG TAT 980 
Tyr Asn Lys Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys Asp Lys Tyr 
280 285 290 

ATT ACT CTG AAT TCA GGC ATT TTA AAT ATT AAT CAA CAA AGA GGT GTT 1028 
He Thr Leu Asn Ser Gly He Leu Asn He Asn Gin Gin Arg Gly Val 
295 300 305 

ACT GAA GGC TCT GTT TTT TTG AAC TAT AAA TTA TAT GAA GGA GTA GAA 107 6 

Thr Glu Gly Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu Gly Val Glu 
310 315 320 

GTC ATT ATA AGA AAA AAT GGT CCT ATA GAT ATA TCT AAT ACA GAT AAT 112 4 

Val He He Arg Lys Asn Gly Pro He Asp He Ser Asn Thr Asp Asn 
325 330 335 

35 TTT GTT AGA AAA AAC GAT CTA GCA TAC ATT AAT GTA GTA GAT CGT GGT 1172 

Phe Val Arg Lys Asn Asp Leu Ala Tyr He Asn Val Val Asp Arg Gly 
340 345 350 355 

GTA GAA TAT CGG TTA TAT GCT GAT ACA AAA TCA GAG AAA GAG AAA ATA 1220 
40 Val Glu Tyr Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lys Glu Lys He 

360 365 370 

ATA AGA ACA TCT AAT CTA AAC GAT AGC TTA GGT CAA ATT ATA GTT ATG 126 8 

He Arg Thr Ser Asn Leu Asn Asp Ser Leu Gly Gin He He Val Met 
375 380 385 

GAT TCA ATA GGA AAT AAT TGC ACA ATG AAT TTT CAA AAC AAT AAT GGG 1316 
Asp Ser He Gly Asn Asn Cys Thr Met Asn Phe Gin Asn Asn Asn Gly 
390 395 400 

AGC AAT ATA GGA TTA CTA GGT TTT CAT TCA AAT AAT TTG GTT GCT AGT 1364 
Ser Asn He Gly Leu Leu Gly Phe His Ser Asn Asn Leu Val Ala Ser 
405 410 415 

55 AGT TGG TAT TAT AAC AAT ATA CGA AGA AAT ACT AGC AGT AAT GGA TGC 1412 

Ser Trp Tyr Tyr Asn Asn He Arg Arg Asn Thr Ser Ser Asn Gly Cys 
420 425 430 435 

TTT TGG AGT TCT ATT TCT AAA GAG AAT GGA TGG AAA GAA TGAAAGCTT 146 0 

60 Phe Trp Ser Ser He Ser Lys Glu Asn Gly Trp Lys Glu 

440 445 

{2} INFORMATION FOR SEQ ID NO: 73: 

65 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



70 



(ii) MOLECULE TYPE: protein 
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30 



3> 



40 



45 



50 



60 



f)5 



70 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Met Gly His His His His His His His His His His Ser Ser Gly His 

^ ' 5 10 



15 



He Glu Gly Arg His Met Ala Ser Met Ala He Leu He lie Tyr Phe 

25 30 

Asn Arg Leu Tyr Lys Lys He Lys Asp Ser Ser He Leu Asp Met Arg 



40 45 
3ly 

ser lie Asn Gly Asn Val Tyr He Tyr Ser Thr Asn Arg Asn Gin Phe 

75 80 



Tyr Glu Asn Asn Lys Phe He Asp He Ser Gly Tyr Gly Ser Asn He 



60 



Gly He Tyr Asn Ser Arg Leu Ser Glu Val Asn He Ala Gin Asn Asn 
85 90 95 

Asp He lie Tyr Asn Ser Arg Tyr Gin Asn Phe Ser He Ser Phe Tro 
100 105 



110 



Val Arg He Pro Lys His Tyr Lys Pro Met Asn His Asn Arg Glu Tvr 
115 120 125 

Thr He He Asn Cys Met Gly Asn Asn Asn Ser Gly-Trp Lys He Ser 
1^0 135 140 

Leu Arg Thr Val Arg Asp Cys Glu He He Trp Thr Leu Gin Asp Thr 

155 160 

Ser Gly Asn Lys Glu Asn Leu He Phe Arg Tyr Glu Glu Leu Asn Arg 
165 170 j^75 

He Ser Asn Tyr He Asn Lys Trp He Phe Val Thr He Thr Asn Asn 
180 185 150 

Arg Leu Gly Asn Ser Arg He Tyr He Asn Gly Asn Leu He Val Glu 
195 200 205 

Lys Ser He Ser Asn Leu Gly Asp He His Val Ser Asp Asn He Leu 
210 215 220 

Phe Lys He Val Gly Cys Asp Asp Glu Thr Tyr Val Gly He Arg Tyr 
230 235 ^ 240 

Phe Lys Val Phe Asn Thr Glu Leu Asp Lys Thr Glu He Glu Thr Leu 
245 250 255 

Tyr Ser Asn Glu Pro Asp Pro Ser He Leu Lys Asn Tyr Trp Gly Asn 
260 265 270 

Tyr Leu Leu Tyr Asn Lys Lys Tyr Tyr Leu Phe Asn Leu Leu Arg Lys 
275 280 285 

Asp Lys Tyr He Thr Leu Asn Ser Gly He Leu Asn He Asn Gin Gin 

295 300 

Arg Gly Val Thr Glu Gly Ser Val Phe Leu Asn Tyr Lys Leu Tyr Glu 

315 320 

Gly Val Glu Val He He Arg Lys Asn Gly Pro He Asd He Ser Asn 
325 330 ' 335 

Thr Asp Asn Phe Val Arg Lys Asn Asp Leu Ala Tyr He Asn Val Val 
340 345 350 

Asp Arg Gly Val Glu Tyr Arg Leu Tyr Ala Asp Thr Lys Ser Glu Lvs 
355 360 3S5 
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Tyr Tyr Lys Ala Phe Arg He He Asp Arg He Trp He Val Pro Glu 
35 40 45 

AGG TTT ACT TAT GGA TTT CAA CCT GAC CAA TTT AAT GCC AGT ACA GGA 192 
> Arg Phe Thr Tyr Gly Phe Gin Pro Asp Gin Phe Asn Ala Ser Thr Gly 

50 55 60 

GTT TTT AGT TULA GAT GTC TAC GAA TAT TAC GAT CCA ACT TAT TTA AAA 24 0 

Val Phe Ser Lys Asp Val Tyr Glu Tyr Tyr Asp Pro Thr Tyr Leu Lys 
10 65 70 75 80 

ACC GAT GCT GAA AAA GAT AAA TTT TTA AAA ACA ATG ATT AAA TJA TTT 2 88 

Thr Asp Ala Glu Lys Asp Lys Phe Leu Lys Thr Met He Lys Leu Phe 

B5 90 95 

15 

AAT AGA ATT AAT TCA AAA CCA TCA GGA CAG AGA TTA CTG GAT ATG ATA 336 

Asn Arg He Asn Ser Lys Pro Ser Gly Gin Arg Leu Leu Asp Met He 

100 105 110 

20 GTA GAT GCT ATA CCT TAT CTT GGA ANV GCA TCT ACA CCG CCC GAC AAA 3 84 

Val Asp Ala He Pro Tyr Leu Gly Asn Ala Ser Thr Pro Pro Asp Lys 
115 120 125 

TTT GGA GCA AAT GTT GCA AAT GTA TCT ATT AAT AAA AAA ATT ATC CAA 4 32 

25 Phe Ala Ala Asn Val Ala Asn Val Ser He Asn Lys Lys He He Gin 

130 135 140 

CCT GGA GCT GAA GAT CAA ATA AAA GGT TTA ATG ACA AAT TTA ATA ATA 480 
Pro Gly Ala Glu Asp Gin He Lys Gly Leu Met Thr Asn Leu He He 
30 145 150 155 160 

TTT GGA CCA GGA CCA GTT CTA AGT GAT AAT TTT ACT GAT AGT ATG ATT 52 8 

Phe Gly Pro Gly Pro Val Leu Ser Asp Asn Phe Thr Asp Ser Met He 
165 170 175 

35 

ATG AAT GGC CAT TCC CCA ATA TCA GAA GGA TTT GGT GCA AGA ATG ATG 576 
Met Asn Gly His Ser Pro He Ser Glu Gly Phe Gly Ala Arg Met Met 
180 185 190 

v4() ATA AGA TTT TGT CCT AGT TGT TTA AAT GTA TTT AAT AAT GTT CAG GAA 624 

He Arg Phe Cys Pro Ser Cys Leu Asn Val Phe Asn Asn Val Gin Glu 
195 200 205 

AAT AAA GAT ACA TCT ATA TTT AGT AGA CGC GCG TAT TTT GCA GAT CCA 6 72 

45 Asn Lys Asp Thr Ser He Phe Ser Arg Arg Ala Tyr Phe Ala Asp Pro 

210 215 220 

GCT CTA ACG TTA ATG CAT GAA CTT ATA CAT GTG TTA CAT GGA TTA TAT 720 
Ala Leu Thr Leu Met His Glu Leu He His Val Leu His Gly Leu Tyr 
50 225 230 235 240 

GGA ATT AAG ATA AGT AAT TTA CCA ATT ACT CCA AAT ACA AAA GAA TTT 768 
Gly He Lys He Ser Asn Leu Pro He Thr Pro Asn Thr Lys Glu Phe 
245 250 255 

55 

TTC ATG CAA CAT AGC GAT CCT GTA CAA GCA GAA GAA CTA TAT ACA TTC 816 
Phe Met Gin His Ser Asp Pro Val Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

60 GGA GGA CAT GAT CCT AGT GTT ATA AGT CCT TCT ACG GAT ATG AAT ATT 864 

Gly Gly His Asp Pro Ser Val He Ser Pro Ser Thr Asp Met Asn He 
275 280 285 

TAT AAT AAA GCG TTA CAA AAT TTT CAA GAT ATA GCT AAT AGG CTT AAT 912 
65 Tyr Asn Lys Ala Leu Gin Asn Phe Gin Asp He Ala Asn Arg Leu Asn 

290 295 300 

ATT GTT TCA AGT GCC CAA GGG AGT GGA ATT GAT ATT TCC TTA TAT AAA 960 
He Val Ser Ser Ala Gin Gly Ser Gly He Asp He Ser Leu Tyr Lys 
70 305 310 315 320 
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CAA ATA TAT AAA AAT AAA TAT GAT TTT GTT GAA GAT CCT AAT GGA AAA 1008 
Gin lie Tyr Lys Asn Lys Tyr Asp Phe Val Glu Asp Pro Asn GXy Lys 
325 330 335 

5 TAT AGT GTA GAT AAG GAT AAG TTT GAT AAA TTA TAT AAG GCC TTA ATG 1056 

Tyr Ser Val Asp Lys Asp Lys Phe Asp Lys Leu Tyr Lys Ala Leu Met 
340 345 350 

TTT GGC TTT ACT GAA ACT AAT CTA GCT GGT GAA TAT GGA ATA AAA ACT 1104 
10 Phe Gly Phe Thr Glu Thr Asn Leu Ala Gly Glu Tyr Gly lie Lys Thr 

355 360 365 

AGG TAT TCT TAT TTT AGT GAA TAT TTG CCA CCG ATA AAA ACT GAA AAA 1152 
Arg Tyr Ser Tyr Phe Ser Glu Tyr Leu Pro Pro lie Lys Thr Glu Lys 
15 370 37S 380 

TTG TTA GAC PJkT ACA ATT TAT ACT CAA AAT GAA GGC TTT AAC ATA GCT 1200 

Leu Leu Asp Asn Thr He Tyr Thr Gin Asn Glu Gly Phe Asn lie Ala 

385 390 395 400 

20 

AGT AAA AAT CTC AAA ACG GAA TTT AAT GGT CAG AAT AAG GCG GTA AAT 124 8 

Ser Lys Asn Leu Lys Thr Glu Phe Asn Gly Gin Asn Lys Ala Val Asn 

405 410 415 

25 AAA GAG GCT TAT GAA GAA ATC AGC CTA GAA CAT CTC GTT ATA TAT AGA 1296 

Lys Glu Ala Tyr Glu Glu lie Ser Leu Glu His Leu Val lie Tyr Arg 
420 425 430 

ATA GCA ATG TGC AAG CCT GTA ATG TAC AAA AAT ACC GGT AAA TCT GAA 1344 
M) He Ala Met Cys Lys Pro Val Met Tyr Lys Asn Thr Gly Lys Ser Glu 

435 440 445 

CAG TGT ATT ATT GTT AAT AAT GAG GAT TTA TTT TTC ATA GCT AAT AAA 1392 
Gin Cys lie He Val Asn Asn Glu Asp Leu Phe Phe He Ala Asn Lys 
35 450 455 460 

GAT AGT TTT TCA AAA GAT TTA GCT AAA GCA GAA ACT ATA GCA TAT AAT 144 0 

Asp Ser Phe Ser Lys Asp Leu Ala Lys Ala Glu Thr He Ala Tyr Asn 

465 470 475 480 

40 

ACA CAA AAT AAT ACT ATA GAA AAT AAT TTT TCT ATA GAT CAG TTG ATT 1488 

Thr Gin Asn Asn Thr He Glu Asn Asn Phe Ser He Asp Gin Leu He 

485 490 495 

45 TTA GAT AAT GAT TTA AGC AGT GGC ATA GAC TTA CCA AAT GAA AAC ACA 1536 

Leu Asp Asn Asp Leu Ser Ser Gly He Asp Leu Pro Asn Glu Asn Thr 
500 505 510 

GAA CCA TTT ACA AAT TTT GAC GAC ATA GAT ATC CCT GTG TAT ATT AAA 1584 
50 Glu Pro Phe Thr Asn Phe Asp Asp He Asp He Pro Val Tyr He Lys 
515 520 525 

CAA TCT GCT TTA AAA AAA ATT TTT GTG GAT GGA GAT AGC CTT TTT GAA 16 32 

Gin Ser Ala Leu Lys Lys He Phe Val Asp Gly Asp Ser Leu Phe Glu 
55 530 535 540 

TAT TTA CAT GCT CAA ACA TTT CCT TCT AAT ATA GAA AAT CTA CAA CTA 16 8 0 

Tyr Leu His Ala Gin Thr Phe Pro Ser Asn lie Glu Asn Leu Gin Leu 
545 550 555 560 



60 



ACG AAT TCA TTA AAT GAT GCT TTA AGA AAT AAT AAT AAA GTC TAT ACT X728 
Thr Asn Ser Leu Asn Asp Ala Leu Arg Asn Asn Asn Lys Val Tyr Thr 
565 570 575 



65 TTT TTT TCT ACA AAC CTT GTT GAA AAA GCT AAT ACA GTT GTA GGT GCT 17 76 

Phe Phe Ser Thr Asn Leu Val Glu Lys Ala Asn Thr Val Val Giv Ala 
580 585 590 

TCA CTT TTT GTA AAC TGG GTA AAA GGA GTA ATA GAT GAT TTT ACA TCT 1824 
70 Ser Leu Phe Val Asn Trp Val Lys Gly Val He Asp Asp Phe Thr Ser 
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595 600 605 

GAA TCC AC A CAA AAA AGT ACT ATA GAT AAA GTT TCA GAT GTA TCC ATA 1872 
Glu Ser Thr Gin Lys Ser Thr He Asp Lys Val Ser Asp Val Ser He 
5 610 615 620 

ATT ATT CCC TAT ATA GGA CCT GCT TTG AAT GTA GGA ART GAA ACA GOT 1920 
He He Pro Tyr tie Gly Pro Ala Leu Asn Val Gly Asn Glu Thr Ala 
625 630 635 640 

7VAA GAA AAT TTT AAA AAT GCT TTT GAA ATA GGT GGA GCC GCT ATC TTA 1968 
Lys Glu Asn Phe Lys Asn Ala Phe Glu He Gly Gly Ala Ala He Leu 
645 650 655 

15 ATG GAG TTT ATT CCA GAA CTT ATT GTA CCT ATA GTT GGA TTT TTT ACA 2016 

Met Glu Phe He Pro Glu Leu He Val Pro He Val Gly Phe Phe Thr 
660 665 670 

TTA GAA TCA TAT GTA GGA AAT AAA GGG CAT ATT ATT ATG ACG ATA TCC 2064 
20 Leu Glu Ser Tyr Val Gly Asn Lys Gly His He He Met Thr He Ser 

675 660 685 

AAT GCT TTA AAG AAA AGG GAT CAA AAA TGG ACA GAT ATG TAT GGT TTG 2112 
Asn Ala Leu Lys Lys Arg Asp Gin Lys Trp Thr Asp Met Tyr Glv Leu 
25 690 695 700 



30 



ATA GTA TCG CAG TGG CTC TCA ACG GTT AAT ACT CAA TTT TAT ACA ATA 216 0 

He Val Ser Gin Trp Leu Ser Thr Val Asn Thr Gin Phe Tyr Thr He 
705 710 715 720 

AAA GAA AGA ATG TAC AAT GCT TTA AAT AAT CAA TCA CAA GCA ATA GAA 2208 
Lys Glu Arg Met Tyr Asn Ala Leu Asn Asn Gin Ser Gin Ala He Glu 
725 730 735 

35 AAA ATA ATA GAA GAT CAA TAT AAT AGA TAT AGT GAA GAA GAT AAA ATG 22 56 

Lys He He Glu Asp Gin Tyr Asn Arg Tyr Ser Glu Glu Asp Lys Met 
740 745 750 

AAT ATT AAC ATT GAT TTT AAT GAT ATA GAT TTT AAA CTT AAT CAA AGT 23 04 

40 Asn He Asn He Asp Phe Asn Asp He Asp Phe Lys Leu Asn Gin Ser 

755 760 765 

ATA AAT TTA GCA ATA AAC AAT ATA GAT GAT TTT ATA AAC CAA TGT TCT 2352 
He Asn Leu Ala He Asn Asn He Asp Asp Phe He Asn Gin Cys Ser 
45 770 775 780 

ATA TCA TAT CTA ATG AAT AGA ATG ATT CCA TTA GCT GTA AAA AAG TTA 2400 

He Ser Tyr Leu Met Asn Arg Met He Pro Leu Ala Val Lys Lys Leu 
785 790 795 800 

50 

AAA GAC TTT GAT GAT AAT CTT AAG AGA GAT TTA TTG GAG TAT ATA GAT 244 8 

Lys Asp Phe Asp Asp Asn Leu Lys Arg Asp Leu Leu Glu Tyr He Asp 

805 810 815 

55 ACA AAT GAA CTA TAT TTA CTT GAT GAA GTA AAT ATT CTA AAA TCA AAA 24 96 

Thr Asn Glu Leu Tyr Leu Leu Asp Glu Val Asn He Leu Lys Ser Lys 
820 825 830 

GTA AAT AGA CAC CTA AAA GAC AGT ATA CCA TTT GAT CTT TCA CTA TAT 2 544 

60 Val Asn Arg His Leu Lys Asp Ser He Pro Phe Asp Leu Ser Leu Tyr 

835 840 845 

ACC AAG GAC ACA ATT TTA ATA CAA GTT TTT AAT AAT TAT ATT AGT AAT 25 92 

Thr Lys Asp Thr He Leu He Gin Val Phe Asn Asn Tyr He Ser Asn 
65 850 855 860 

ATT AGT AGT AAT GCT ATT TTA AGT TTA AGT TAT AGA GGT GGG CGT TTA 2640 
He Ser Ser Asn Ala He Leu Ser Leu Ser Tyr Arg Gly Gly Arg Leu 
865 870 875 880 
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ATA GAT TCA TCT GGA TAT GOT GCA ACT ATG AAT GTA GGT TCA GAT GTT 2688 
He Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn Val Gly Ser Asp Val 
885 890 895 

5 ATC TTT AAT GAT ATA GGA AAT GGT CAA TTT AAA TTA AAT AAT TCT GAA 2 736 

He Phe Asn Asp He Gly Asn Gly Gin Phe Lys Leu Asn Asn Ser Glu 
900 905 910 

AAT AGT AAT ATT ACG GCA CAT CAA AGT AAA TTC GTT GTA TAT GAT AGT 2 784 

10 Asn Ser Asn He Thr Ala His Gin Ser Lys Phe Val Val Tyr Asp Ser 

915 920 925 

ATG TTT GAT AAT TTT AGC ATT AAC TTT TGG GTA AGG ACT CCT AAA TAT 2832 
Met Phe Asp Asn Phe Ser He Asn Phe Trp Val Arg Thr Pro Lys Tyr 
15 930 935 940 

AAT AAT AAT GAT ATA CAA ACT TAT CTT CAA AAT GAG TAT ACA ATA ATT 2880 
Asn Asn Asn Asp He Gin Thr Tyr Leu Gin Asn Glu Tyr Thr He He 
945 950 955 960 

20 

AGT TGT ATA AAA AAT GAC TCA GGA TGG AAA GTA TCT ATT AAG GGA AAT 2928 
Ser Cys He Lys Asn Asp Ser Gly Trp Lys Val Ser He Lys Gly Asn 
965 970 975 

25 AGA ATA ATA TGG ACA TTA ATA GAT GTT AAT GCA AAA TCT AAA TCA ATA 2976 

Arg He He Trp Thr Leu He Asp Val Asn Ala Lys Ser Lys Ser He 
980 985 990 

TTT TTC GAA TAT AGT ATA AAA GAT AAT ATA TCA GAT TAT ATA AAT AAA 3024 
30 Phe Phe Glu Tyr Ser He Lys Asp Asn He Ser Asp Tyr He Asn Lys 

99S 1000 1005 

TGG TTT TCC ATA ACT ATT ACT AAT GAT AGA TTA GGT AAC GCA AAT ATT 3072 
Trp Phe Ser He Thr He Thr Asn Asp Arg Leu Gly Asn Ala Asn He 
35 IGIO 1015 X020 

TAT ATA AAT GGA AGT TTG AAA AAA AGT GAA AAA ATT TTA AAC TTA GAT 3120 
Tyr He Asn Gly Ser Leu Lys Lys Ser Giu Lys lie Leu Asn Leu Asp 
1025 1030 1035 1040 

AGA ATT AAT TCT AGT AAT GAT ATA GAC TTC AAA TTA ATT AAT TGT ACA 3168 
Arg He Asri Ser Ser Asn Asp He Asp Phe Lys Leu He Asn Cvs Thr 
1045 lOSO 1055 

45 GAT ACT ACT AAA TTT GTT TGG ATT AAG GAT TTT AAT ATT TTT GGT AGA 3216 

Asp Thr Thr Lys Phe Val Trp He Lys Asp Phe Asn He Phe Gly Arg 
1060 1065 1070 

GAA TTA AAT GCT ACA GAA GTA TCT TCA CTA TAT TGG ATT CAA TCA TCT 3 264 

50 Glu Leu Asn Ala Thr Glu Val Ser Ser Leu Tyr Trp He Gin Ser Ser 

1075 1080 1085 

ACA AAT ACT TTA AAA GAT TTT TGG GGG AAT CCT TTA AGA TAC GAT ACA 3 312 

Thr Asn Thr Leu Lys Asp Phe Trp Gly Asn Pro Leu Arg Tyr Asp Thr 
55 1090 1095 1100 

CAA TAC TAT CTG TTT AAT CAA GGT ATG CAA AAT ATC TAT ATA AAG TAT 336 0 

Gin Tyr Tyr Leu Phe Asn Gin Gly Met Gin Asn He Tyr He Lys Tyr 
1105 1110 1115 1120 



40 



60 



TTT AGT AAA GCT TCT ATG GGG GAA ACT GCA CCA CGT ACA AAC TTT AAT 3408 
Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro Arg Thr Asn Phe Asn 
1125 1130 1135 
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AAT GCA GCA ATA AAT TAT CAA AAT TTA TAT CTT GGT TTA CGA TTT ATT 34 56 

Asn Ala Ala lie Asn Tyr Gin Asn Leu Tyr Leu Gly Leu Arg Phe He 
1140 1145 1150 

5 ATA AAA AAA GCA TCA AAT TCT CGG AAT ATA AAT AAT GAT AAT ATA GTC 3 504 

He Lys Lys Ala Ser Asn Ser Arg Asn He Asn Asn Asp Asn He Val 
1155 1160 1165 

AGA GAA GGA GAT TAT ATA TAT CTT AAT ATT GAT AAT ATT TCT GAT GAA 3 552 

10 Arg Glu Gly Asp Tyr He Tyr Leu Asn He Asp Asn He Ser Asp Glu 
1170 1175 1180 

TCT TAC AGA GTA TAT GTT TTG GTG AAT TCT AAA GAA ATT CAA ACT CAA 3600 
Ser Tyr Arg Val Tyr Val Leu Val Asn Ser Lys Glu He Gin Thr Gin 
15 1185 1190 1195 1200 

TTA TTT TTA GCA CCC ATA AAT GAT GAT CCT ACG TTC TAT GAT GTA CTA 3648 
Leu Phe Leu Ala Pro He Asn Asp Asp Pro Thr Phe Tyr Asp Val Leu 
1205 1210 1215 



20 



45 



50 



60 



CAA ATA AAA AAA TAT TAT GAA AAA ACA ACA TAT AAT TGT CAG ATA CTT 36 96 

Gin He Lys Lys Tyr Tyr Glu Lys Thr Thr Tyr Asn Cys Gin He Leu 
1220 1225 1230 



25 TGC GAA AAA GAT ACT AAA ACA TTT GGG CTG TTT GGA ATT GGT AAA TTT 3 74 4 

Cys Glu Lys Asp Thr Lys Thr Phe Gly Leu Phe Gly lie Gly Lys Phe 
1235 1240 1245 

GTT AAA GAT TAT GGA TAT GTT TGG GAT ACC TAT GAT AAT TAT TTT TGC 3792 
30 Val Lys Asp Tyr Gly Tyr Val Trp Asp Thr Tyr Asp Asn Tyr Phe Cys 

1250 1255 1260 

ATA AGT CAG TGG TAT CTC AGA AGA ATA TCT GAA AAT ATA AAT AAA TTA 384 0 

He Ser Gin Trp Tyr Leu Arg Arg He Ser Glu Asn He Asn Lys Leu 
35 1265 1270 1275 1280 

AGG TTG GGA TGT AAT TGG CAA TTC ATT CCC GTG GAT GAA GGA TGG ACA 3 888 

Arg Leu Gly Cys Asn Trp Gin Phe He Pro Val Asp Glu Gly Trp Thr 
1285 1290 1295 

40 

GAA TAA 38 94 

Glu 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

til) MOLECULE TYPE: protein 

(xi) SEOtJENCE DESCRIPTION: SEQ ID NO: 77: 

55 Met Pro Val Asn He Lys Asn Phe Asn Tyr Asn Asp Pro lie Asn Asn 

15 10 15 



Asp Asp He He Met Met Glu Pro Phe Asn Asp Pro Gly Pro Gly Thr 

20 25 30 

Tyr Tyr Lys Ala Phe Arg lie He Asp Arg He Trp He Val Pro Glu 

35 40 45 



Arg Phe Thr Tyr Gly Phe Gin Pro Asp Gin Phe Asn Ala Ser Thr Gly 
65 50 55 60 

Val Phe Ser Lys Asp Val Tyr Glu Tyr Tyr Asp Pro Thr Tyr Leu Lys 
65 70 *?5 80 

70 Thr Asp Ala Glu Lys Asp Lys Phe Leu Lys Thr Met He Lys Leu Phe 
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85 90 95 

Asn Arg lie Asn Ser Lys Pro Ser Gly Gin Arg Leu Leu Asp Met He 
100 105 110 

Val Asp Ala He Pro Tyr Leu Gly Asn Ala Ser Thr Pro Pro Asp Lys 
115 120 125 

Phe Ala Ala Asn Val Ala Asn Val Ser He Asn Lys Lys He He Gin 
130 135 140 

Pro Gly Ala Glu Asp Gin He Lys Gly Leu Met Thr Asn Leu I^le He 
150 155 160 

Phe Gly Pro Gly Pro Val Leu Ser Asp Asn Phe Thr Asp Ser Met He 
165 170 175 

Met Asn Gly His Ser Pro He Ser Glu Gly Phe Gly Ala Arg Met Met 
180 185 190 

He Arg Phe Cys Pro Ser Cys Leu Asn Val Phe Asn Asn Val Gin Glu 
195 200 205 

Asn Lys Asp Thr Ser He Phe Ser Arg Arg Ala Tyr Phe Ala Asp Pro 
210 215 220 

Ala Leu Thr Leu Met His Glu Leu He His Val Leu His Gly Leu Tvr 
225 230 235 240 

Gly He Lys He Ser Asn Leu Pro He Thr Pro Asn Thr Lys Glu Phe 
245 250 255 

Phe Met Gin His Ser Asp Pro Val Gin Ala Glu Glu Leu Tyr Thr Phe 
260 265 270 

Gly Gly His Asp Pro Ser Val He Ser Pro Ser Thr Asp Met Asn He 
275 280 285 

Tyr Asn Lys Ala Leu Gin Asn Phe Gin Asp He Ala Asn Arg Leu Asn 
290 295 300 

He Val Ser Ser Ala Gin Gly Ser Gly He Asp He Ser Leu Tyr Lys 
305 310 315 320 

Gin He Tyr Lys Asn Lys Tyr Asp Phe Val Glu Asp Pro Asn Gly Lys 
325 330 335 

Tyr Ser Val Asp Lys Asp Lys Phe Asp Lys Leu Tyr Lys Ala Leu Met 
340 345 350 

Phe Gly Phe Thr Glu Thr Asn Leu Ala Gly Glu Tyr Gly He Lys Thr 
355 360 365 

Arg Tyr Ser Tyr Phe Ser Glu Tyr Leu Pro Pro He Lys Thr Glu Lys 
370 375 380 

Leu Leu Asp Asn Thr He Tyr Thr Gin Asn Glu Gly Phe Asn He Ala 
385 390 395 400 

Ser Lys Asn Leu Lys Thr Glu Phe Asn Gly Gin Asn Lys Ala Val Asn 
405 410 415 

Lys Glu Ala Tyr Glu Glu He Ser Leu Glu His Leu Val He Tyr Arg 
420 425 430 

He Ala Met Cys Lys Pro Val Met Tyr Lys Asn Thr Gly Lys Ser Glu 
435 440 445 

Gin Cys He He Val Asn Asn Glu Asp Leu Phe Phe He Ala Asn Lys 
450 455 460 
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Asp Ser Phe Ser Lys Asp Leu Ala Lys Ala Glu Thr He Ala Tyr Asn 
'^^O 475 460 

Thr Gin Asn Asn Thr He Glu Asn Asn Phe Ser He Asp Gin Leu He 
485 

Leu Asp Asn Asp Leu Ser Ser Gly He Asp Leu Pro Asn Glu Asn Thr 
500 

Glu Pro Phe Thr Asn Phe Asp Asp He Asp He Pro Val Tyr He Lys 
515 520 525 ^ 

Gin Ser Ala Leu Lys Lys He Phe Val Asp Gly Asp Ser Leu Phe Glu 

535 540 

Tyr Leu His Ala Gin Thr Phe Pro Ser Asn He Glu Asn Leu Gin Leu 
550 555 560 

Thr Asn Ser Leu Asn Asp Ala Leu Arg Asn Asn Asn Lys Val Tyr Thr 
565 570 575 

Phe Phe Ser Thr Asn Leu Val Glu Lys Ala Asn Thr Val Val Gly Ala 
580 585 590 

Ser Leu Phe Val Asn Trp Val Lys Gly Val He Asp Asp Phe Thr Ser 
595 600 605 

Glu Ser Thr Gin Lys Ser Thr He Asp Lys Val Ser Asp Val Ser He ^ 
610 615 g20 

He He Pro Tyr He Gly Pro Ala Leu Asn Val Gly Asn Glu Thr Ala 
630 635 640 

Lys Glu Asn Phe Lys Asn Ala Phe Glu He Gly Gly Ala Ala He Leu 
645 650 655 

Met Glu Phe He Pro Glu Leu He Val Pro lie Val Gly Phe Phe Thr 
660 665 670 

Leu Glu Ser Tyr Val Gly Asn Lys Gly His He He Met Thr He Ser 
^"^5 680 685 

Asn Ala Leu Lys Lys Arg Asp Gin Lys Trp Thr Asp Met Tyr Gly Leu 
690 695 700 

He Val Ser Gin Trp Leu- Ser Thr Val Asn Thr Gin Phe Tyr Thr He 
'^05 710 715 720 

Lys Glu Arg Met Tyr Asn Ala Leu Asn Asn Gin Ser Gin Ala He Glu 
725 730 735 

Lys He He Glu Asp Gin Tyr Asn Arg Tyr Ser Glu Glu Asp Lys Met 
740 745 750 

Asn He Asn He Asp Phe Asn Asp He Asp Phe Lys Leu Asn Gin Ser 
755 760 765 

He Asn Leu Ala He Asn Asn He Asp Asp Phe He Asn Gin Cys Ser 
770 775 780 

He Ser Tyr Leu Met Asn Arg Met He Pro Leu Ala Val Lys Lys Leu 
•'^O 795 800 

Lys Asp Phe Asp Asp Asn Leu Lys Arg Asp Leu Leu Glu Tyr He Asp 
805 810 815 

Thr Asn Glu Leu Tyr Leu Leu Asp Glu Val Asn He Leu Lys Ser Lvs 
820 825 830 

Val Asn Arg His Leu Lys Asp Ser He Pro Phe Asp Leu Ser Leu Tyr 
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835 840 845 

Thr Lys Asp Thr lie Leu lie Gin Val Phe Asn Asn Tyr lie Ser Asn 
850 855 860 

lie Ser Ser Asn Ala lie Leu Ser Leu Ser Tyr Arg Gly Gly Arg Leu 
865 870 875 880 

He Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn Val Gly Ser Asp Val 
885 890 895 

He Phe Asn Asp He Gly Asn Gly Gin Phe Lys Leu Asn Asn Ser Glu 
900 905 910 

Asn Ser Asn He Thr Ala His Gin Ser Lys Phe Val Val Tvr Asp Ser 
915 920 925 

Met Phe Asp Asn Phe Ser He Asn Phe Trp Val Arg Thr Pro Lys Tyr 
930 935 940 

Asn Asn Asn Asp He Gin Thr Tyr Leu Gin Asn Glu Tyr Thr He He 
945 950 ' 955 960 

Ser Cys He Lys Asn Asp Ser Gly Trp Lys Val Ser He Lys Gly Asn 
965 970 975 

Arg :ie He Trp Thr Leu He Asp Val Asn Ala Lys Ser Lys Ser lie 
980 985 ' 990 

Phe Phe Glu Tyr Ser He Lys Asp Asn He Ser Asp Tyr He Asn Lys 
995 1000 1005 

Trp Phe Ser He Thr He Thr Asn Asp Arg Leu Gly Asn Ala Asn He 
1010 1015 1020 

Tyr He Asn Gly Ser Leu Lys Lys Ser Glu Lys He Leu Asn Leu Asp 
1025 1030 1035 1040 

Arg He Asn Ser Ser Asn Asp He Asp Phe Lys Leu He Asn Cys Thr 
1045 1050 1055 

Asp Thr Thr Lys Phe Val Trp He Lys Asp Phe Asn He Phe Gly Arg 
1060 1065 1070 

Glu Leu Asn Ala Thr Glu Val Ser Ser Leu Tyr Trp He Gin Ser Ser 
1075 1080 ^ 1085 

Thr Asn Thr Leu Lys Asp Phe Trp Gly Asn Pro Leu Arg Tyr Asp Thr 
1090 1095 1100 

Gin Tyr Tyr Leu Phe Asn Gin Giy Met Gin Asn He Tyr He Lys Tyr 
1105 1110 1115 1120 

Phe Ser Lys Ala Ser Met Giy Glu Thr Ala Pro Arg Thr Asn Phe Asn 
1125 1130 1135 

Asn Ala Ala He Asn Tyr Gin Asn Leu Tyr Leu Gly Leu Arg Phe He 
1140 1145 1150 

He Lys Lys Ala Ser Asn Ser Arg Asn He Asn Asn Asp Asn Ho Val 
1155 1160 1165 

Arg Glu Gly Asp Tyr He Tyr Leu Asn He Asp Asn He Ser Asp Glu 
1170 1175 1180 

Ser Tyr Arg Val Tyr Val Leu Val Asn Ser Lys Glu He Gin Thr Gin 
1185 1190 1195 1200 

Leu Phe Lou Ala Pro He Asn Asp Asp Pro Thr Phe Tyr Asp Val Leu 
1205 1210 ' 1215 
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Gin He Lys Lys Tyr Tyr Glu hys Thr Thr Tyr Asn Cys Gin He Leu 
1220 1225 1230 

Cys Glu Lys Asp Thr Lys Thr Phe Gly Leu Phe Gly He Gly Lys Phe 
1235 1240 1245 

Val Lys Asp Tyr Gly Tyr Val Trp Asp Thr Tyr Asp Asn Tyr Phe Cys 
1250 1255 1260 

10 He Ser Gin Trp Tyr Leu Arg Arg He Ser Glu Asn He Asn Lys Leu 

1265 1270 1275 1280 



\5 



Arg Leu Gly Cys Asn Trp Gin Phe He Pro Val Asp Glu Gly Trp Thr 
1285 1290 1295 

Glu 

(2) INFORMATION FOR SEQ ID NO: 7B; 

20 H) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1535 base pairs 
{B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

2S 

MOLECULE TYPE: DNA (genomic) 

!ix) FEATURE: 

(A) NAME/KEY: CDS 
30 (B) LOCATION: 108 1526 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO:78r 

AGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA 60 

TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG ATATACC ATG GGC CAT 116 

Met Gly His 
1 

40 CAT CAT CAT CAT CAT CAT CAT CAT CAC AGC AGC GGC CAT ATC GAA GGT 164 

His His His His His His His His His Ser Ser Gly His He Glu Gly 
5 10 15 

CGT CAT ATG GCT AGC ATG GCT GAC ACA ATT TTA ATA CAA GTT TTT AAT 212 
45 Arg His Met Ala Ser Met Ala Asp Thr lie Leu He Gin Val Phe Asn 

20 25 30 35 

AAT TAT ATT AGT AAT ATT AGT AGT AAT GCT ATT TTA AGT TTA AGT TAT 26 0 

Asn Tyr He Ser Asn He Ser Ser Asn Ala He Leu Ser Leu Ser Tyr 
30 ' 40 45 50 

AGA GGT GGG CGT TTA ATA GAT TCA TCT GGA TAT GGT GCA ACT ATG AAT 3 08 

Arg Gly Gly Arg Leu He Asp Ser Ser Gly Tyr Gly Ala Thr Met Asn 
55 60 65 

:>5 

GTA GGT TCA GAT GTT ATC TTT AAT GAT ATA GGA AAT GGT CAA TTT AAA 3 56 

Val Gly Ser Asp Val He Phe Asn Asp He Gly Asn Gly Gin Phe Lys 
70 75 80 

60 TTA PJVT AAT TCT GAA AAT AGT AAT ATT ACG GCA CAT CAA AGT AAA TTC 4 04 

Leu Asn Asn Ser Glu Asn Ser Asn He Thr Ala His Gin Ser Lys Phe 
85 90 95 

GTT GTA TAT GAT AGT ATG TTT GAT AAT TTT AGC ATT AAC TTT TGG GTA 4 52 

(^5 Val Val Tyr Asp Ser Met Phe Asp Asn Phe Ser He Asn Phe Trp Val 

100 105 110 115 

AGG ACT CCT AAA TAT AAT AAT AAT GAT ATA CAA ACT TAT CTT CAA AAT 500 
Arg Thr Pro Lys Tyr Asn Asn Asn Asp He Gin Thr Tyr Leu Gin Asn 
70 120 125 130 
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GAG TAT ACA ATA ATT AGT TGT ATA AAA AAT GAC TCA GGA TGG AAA GTA 54 6 

Glu Tyr Thr lie lie Ser Cys He Lys Asn Asp Ser Gly Trp Lys Val 
135 140 145 

5 TCT ATT AAG GGA AAT AGA ATA ATA TGG ACA TTA ATA GAT GTT AAT GCA 596 

Ser He Lys Gly Asn Arg He He Trp Thr beu He Asp Val Asn Ala 
150 155 160 

AAA TCT AAA TCA ATA TTT TTC GAA TAT AGT ATA AAA GAT AAT ATA TCA 644 

10 Lys Ser Lys Ser He Phe Phe Glu Tyr Ser He Lys Asp Asn He Ser 

165 170 175 

GAT TAT ATA AAT AAA TGG TTT TCC ATA ACT ATT ACT AAT GAT AGA TTA 6 92 

Asp Tyr He Asn Lys Trp Phe Ser He Thr He Thr Asn Asp Arg Leu 

15 180 185 190 195 

GGT AAC GCA AAT ATT TAT ATA AAT GGA AGT TTG AAA AAA AGT GAA AAA 74 0 

Gly Asn Ala Asn He Tyr He Asn Gly Ser Leu Lys Lys Ser Glu Lys 
200 205 210 

20 

ATT TTA AAC TTA GAT AGA ATT AAT TCT AGT AAT GAT ATA GAC TTC AAA 78 8 

He Leu Asn Leu Asp Arg He Asn Ser Ser Asn Asp He Asp Phe Lys 
215 220 225 

25 TTA ATT AAT TGT ACA GAT ACT ACT AAA TTT GTT TGG ATT AAG GAT TTT 836 

Leu He Asn Cys Thr Asp Thr Thr Lys Phe Val Trp He Lys Asp Phe 
230 235 240 

AAT ATT TTT GGT AGA GAA TTA AAT GCT ACA GAA GTA TCT TCA CTA TAT 884 

30 Aijn He Phe Gly Arg Glu Leu Asn Ala Thr Glu Val Ser Ser Leu Tyr 

245 250 255 

TGG ATT CAA TCA TCT ACA AAT ACT TTA AAA GAT TTT TGG GGG AAT CCT 93 2 

Trp He Gin Ser Ser Thr Asn Thr Leu Lys Asp Phe Trp Gly Asn Pro 

!>5 260 26S 270 275 

TTA AGA TAC GAT ACA CAA TAC TAT CTG TTT AAT CAA GGT ATG CAA AAT 980 

Leu Arg Tyr Asp Thr Gin Tyr Tyr Leu Phe Asn Gin Gly Met Gin Asn 
280 285 290 

ATC TAT ATA AAG TAT TTT AGT AAA GCT TCT ATG GGG GAA ACT GCA CCA 1028 

He Tyr He Lys Tyr Phe Ser Lys Ala Ser Met Gly Glu Thr Ala Pro 
295 300 305 

45 CGT ACA AAC TTT AAT AAT GCA GCA ATA AAT TAT CAA AAT TTA TAT CTT 1076 

Arg Thr Asn Phe Asn Asn Ala Ala He Asn Tyr Gin Asn Leu Tyr Leu 
310 315 320 

GGT TTA CGA TTT ATT ATA AAA AAA GCA TCA AAT TCT CGG AAT ATA AAT 1124 

50 Gly Leu Arg Phe He He Lys Lys Ala Ser Asn Ser Arg Asn He Asn 

325 330 335 

AAT GAT AAT ATA GTC AGA GAA GGA GAT TAT ATA TAT CTT AAT ATT GAT 1172 

Asn Asp Asn He Val Arg Glu Gly Asp Tyr He Tyr Leu Asn He Asp 

55 340 345 350 355 

AAT ATT TCT GAT GAA TCT TAC AGA GTA TAT GTT TTG GTG AAT TCT AAA .1220 

Asn He Ser Asp Glu Ser Tyr Arg Val Tyr Val Leu Val Asn Ser Lys 
360 365 370 

(^{) 

GAA ATT CAA ACT CAA TTA TTT TTA GCA CCC ATA AAT GAT GAT CCT ACG 1268 

Glu ,He Gin Thr Gin Leu Phe Leu Ala Pro He Asn Asp Asp Pro Thr 
375 380 385 

65 TTC TAT GAT GTA CTA CAA ATA AAA AAA TAT TAT GAA AAA ACA ACA TAT 1316 

Phe Tyr Asp Val Leu Gin He Lys Lys Tyr Tyr Glu Lys Thr Thr Tyr 
390 395 400 

AAT TGT CAG ATA CTT TGC GAA AAA GAT ACT AAA ACA TTT GGG CTG TTT 1364 

70 Asn Cys Gin He Leu Cys Glu Lys Asp Thr Lys Thr Phe Gly Leu Phe 



40 
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405 . 410 415 

GGA ATT GGT AAA TTT GTT AAA GAT TAT GGA TAT GTT TGG GAT ACC TAT 1412 
Gly lie Gly Lys Phe Val Lys Asp Tyr Gly Tyr Val Trp Asp Thr Tyr 
3 420 425 430 435 

GAT AAT TAT TTT TGC ATA AGT CAG TGG TAT CTC AGA AG A ATA TCT GAA 1460 

Asp Asn Tyr Phe Cys lie Ser Gin Trp Tyr Leu Arg Arg lie Ser Glu 
440 445 450 

0 

AAT ATA AAT AAA TTA AGG TTG GGA TGT AAT TGG CAA TTC ATT CCC GTG IS 08 

Asn lie Asn Lys Leu Arg Leu Gly Cys Asn Trp Gin Phe lie Pro Val 
455 460 465 

15 GAT GAA GGA TGG ACA GAA TAACTCGAG 153 5 

Asp Glu Gly Trp Thr Glu 
470 



20 



25 



33 



50 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 47 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

\xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Met Giy His His His His His His His His His His Ser Ser Gly His 
15 10 15 

lie Glu Gly Arg His Met Ala Ser Met Ala Asp Thr He Leu He Gin 
20 25 30 

Val Phe Asn Asn Tyr He Ser Asn He Ser Ser Asn Ala He Leu Ser 
35 40 45 



Leu Ser Tyr Arg Gly Gly Arg Leu lie Asp Ser Ser Gly Tyr Gly Ala 
40 50 55 60 

Thr Met Asn Val Gly Ser Asp Val He Phe Asn Asp He Gly Asn Gly 
65 70 75 80 

45 Gin Phe Lys Leu Asn Asn Ser Glu Asn Ser Asn lie Thr Ala His Gin 

85 90 95 



ser Lys Phe Val Val Tyr Asp Ser Met Phe Asp Asn Phe Ser He Asn 

100 105 110 

Phe Trp Val Arg Thr Pro Lys Tyr Asn Asn Asn Asp He Gin Thr Tyr 

115 120 125 



Leu Gin Asn Glu Tyr Thr He He Ser Cys He Lys Asn Asp Ser Gly 
55 130 135 140 



Trp Lys Val Ser 
145 

60 Val Asn Ala Lys 



Asn He Ser Asp 
180 

65 

Asp Arg Leu Gly 
195 

Ser Glu Lys He 
70 210 



He Lys Gly Asn Arg He 
150 

Ser Lys Ser He Phe Phe 
165 170 

Tyr He Asn Lys Trp Phe 
185 

Asn Ala Asn He Tyr He 
200 

Leu Asn Leu Asp Arg He 
215 



He Trp Thr Leu He Asp 
155 160 

Glu Tyr Ser lie Lys Asp 
175 

Ser He Thr He Thr Asn 
190 

Asn Gly Ser Leu Lys Lys 
205 

Asn Ser Ser Asn Asp He 
220 
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10 



15 



30 



45 



50 



Asp Phe Lys Leu lie Asn Cys Thr Asp Thr Thr Lys Phe Val Trp lie 
225 230 235 240 

Lys Asp Phe Asn lie Phe Gly Arg Glu Leu Asn Ala Thr Glu Val Ser 
245 250 255 

Ser Leu Tyr Trp He Gin Ser Ser Thr Asn Thr Leu Lys Asp Phe Trp 
260 26S 270 

Gly Asn Pro Leu Arg Tyr Asp Thr Gin Tyr Tyr Leu Phe Asn Gin Gly 
275 280 285 

Met Gin Asn lie Tyr He Lys Tyr Phe Ser Lys Ala Ser Met Gly Glu 
290 ' 295 300 

Thr Ala Pro Arg Thr Asn Phe Asn Asn Ala Ala He Asn Tyr Gin Asn 
305 310 315 320 

Leu Tyr Leu Gly Leu Arg Phe He He Lys Lys Ala Ser Asn Ser Arg 
325 330 335 

Asn He Asn Asn Asp Asn He Val Arg Glu Gly Asp Tyr He Tyr Leu 
340 345 350 

Asn He Asp Asn He Ser Asp Glu Ser Tyr Arg Val Tyr Val Leu Val 
35S 360 365 

Asn Ser Lys Glu He Gin Thr Gin Leu Phe Leu Ala Pro He Asn Asp 
37Q 375 380 

Asp Pro Thr Phe Tyr Asp Val Leu Gin He Lys Lvs Tyr Tyr Glu Lvs 
385 390 395 " 400 



Thr Thr Tyr Asn Cys Gin He Leu Cys Glu Lys Asp Thr Lys Thr Phe 
35 405 410 415 

Gly Leu Phe Gly He Gly Lys Phe Val Lys Asp Tyr Gly Tyr Val Trp 
420 425 430 

40 Asp Thr Tyr Asp Asn Tyr Phe Cys He Ser Gin Trp Tyr Leu Arg Arg 

435 440 445 



He Ser Glu Asn He Asn Lys Leu Arg Leu Gly Cys Asn Trp Gin Phe 

450 455 460 

He Pro Val Asp Glu Gly Trp Thr Glu 

465 470 

(2) INFORMATION FOR SEQ ID NO;80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

^^0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

CGCCATGGCT GACACAATTT TAATACAAGT 3 0 

(2) INFORMATION FOR SEQ ID NO: 81: 

C>5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
(Ci STRANDEDNESS: sxngle 

70 (D) TOPOLOGY: linear 
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10 



15 



20 



2:> 



(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc ^ "DNA" 

Ul) SEQUENCE DESCRIPTION: SEQ ID N0:81: 

GCCTCGAGTT ATTCTGTCCA TCCTTCATCC AC ^2 

(2) INFORMATION FOR SEQ ID NO: 62: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acxds 

(B) TYPE: ammo acid 

fC) STRANDEDNESS: not relevant 
(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

( IX ) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

rh.c ^r.Jl^ OTHER INFORMATION: /note= "The asparagine resxdue at 
this posxtion contains an amide group." 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

Cys Gin Thr He Asp Gly Lys Lys Tyr Tvr Phe Asn 
^ 5 10 
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CLAIMS 

1 . A host cell containing a recombinant expression vector, said vector encoding a 
protein comprising at least a portion of a Closindium boiidinum toxin, said toxin selected 

5 IVom the group consisting of type B toxin and type E toxin, 

2. The host cell of Claim I, wherein and said host cell is capable of expressing 
said protein at a level greater than or equal to 5% of the total cellular protein. 

10 3. The host cell of Claim 1, wherein and said host cell is capable of expressing 

said protein as a soluble protein at a level greater than or equal to 0.25% of the total soluble 
cellular protein, 

4. The host cell of Claim 1. wherein said host cell is an Escherichia coli cell. 

15 

5. The host cell of Claim L wherein said host cell is an insect ceil. 

6. The host cell of Claim K wherein said host cell is a yeast cell. 

20 7. A host cell containing a recombinant expression vector, said vector encoding a 

fusion protein comprising a non-toxin protein sequence and at least a portion of a Clostridium 
hotulinum toxin, said toxin selected from the group consisting of type B toxin and type F 
toxin. 

25 8. The host cell of Claim 7. wherein said portion of said toxin comprises the 

receptor binding domain. 

9. The host cell of Claim 7, wherein said non-toxin protein sequence comprises a 
poly-histidinc tract. 

30 

10. A vaccine comprising a fusion protein, said fusion protein comprising a non- 
toxin protein sequence and at least a portion of a Clostridium hotulinum toxin, said toxin 
selected from the group consisting of type B toxin and type E toxin. 
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1 1 . The vaccine of Claim 10 further comprising a fusion protein comprising a non- 
toxin protein sequence and at least a portion of Clostridium botulinum type A toxin. 

12. The vaccine of Claim 10, wherein said portion of said Clostridium botulinum 
5 toxin comprises the receptor binding domain. 

13. The vaccine of Claim 10 wherein said non-toxin protein sequence comprises a 
poly-histidine tract. 

10 14. The vaccine of Claim 10, wherein said vaccine is substantially endotoxin-frec. 

15. A method of generating antibody directed against a Clostridium botulinum 
toxin comprising: 

a) providing in any order: 

15 i) an antigen comprising a fusion protein comprising a non-toxin 

protein sequence and at least a portion of a Clostridium botulimmi toxin, said 
toxin selected from the group consisting of type B toxin and type E toxin, and 
ii) a host; and 

b) immunizing said host with said antigen so as to generate an antibody. 

20 

16. The metliod of Claim 15, wherein said antigen further comprises a fusion 
protein comprising a non-toxin protein sequence and at least a portion of Clostridium 
botulinum type A toxin. 

25 17. The method of Claim 15. wherein said portion of said Clostridium botulinum 

toxin comprises the receptor binding domain. 

18. The method of Claim 15 wherein said non-toxin protein sequence comprises a 
poly-histidine tract. 



0 



19. The method of Claim 15 wherein said host is a mammal. 



20. The method of Claim 19 wherein said mammal is a human. 
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21. The method of Claim 15 further comprising step c) collecting said antibodies 
from said host. 

22. The method of Claim 21 further comprising step d) purifying said antibodies. 

23. The antibody raised according to the method of Claim 15. 

24. The antibody raised according to the method of Claim 16. 
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FIGURE 1 
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FIGURE 2 
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FIGURE 3 
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FIGURE 4 
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FIGURE 5 
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FIGURE 6 




Fragment 



Fragment 3 



EcoRI 
f 



Pstl 



P3 



Fragment 2 



P4 



P1-P4 ^rePCR prioKirs 1^. PI =5'GGAAATTTAGCTGCAGCATCTGAC3'. 
PZ=STCTAGCAMnCG(TrGTmGAA3\P3=S'CrC^ 

P4==5'CTATCrAGGCCrAAA6TAT3*. Indicated restriction sites in fragments 1 and 2 are internal 
sites used to done into pGEXZT vector (fragment 1 ; construct called p6A30-6G0) or pMALc 
vector (finagment Z; construct called pMA660-n00). Bracketed restriction sites at ends of 
fragment 3 are pUC9 polylinlcer sites utilized to cbne fragment 3 into pETZ3 vector 
(construct called pPAl 100-2680), Numbers in these constructs refer to toxin A amino add 
interval that is expressed The shaded portion of the toxin A gene corresponds to the repeating 
iigand binding domain. 
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FIGURE 8 



pPA660-2366 
pPAl 100-2366 
pPA30-2366 
pPA30-2190 
PMA30-1870 
pMA66a.1&70 
Toxin A 



Htncn Earf^ Spel Etx)RI PstI 



pP fefors to pET23 vedor, pM refers to pMALc vector. A refofs to toxin A, and numbers refer to 
amino acid interval expressed in done. Endpoints of clones correspond to Indicated restrictHXi sites 
shown of toxin A map. 
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FIGURE 9 



Xbal Gal 
, I 1 ^^^^^^^^ 

Toxin A \ J 1 1 1 ^^^^^^^^^^T 

Hindi PstI Ecc^l PstI StuI Spel Psl\ PstI 

pMA30-270 t I 
pMA30-300 t i 

pMAnOO-1610 \ \ 
PMA300-660 t » pMA1610-1870 U - ■ 

pMA660-nOO 1 i pMAl 870-2680 ■■■■■■^■■l 

pMAM50-1870 I ' < 

pPAll 00-1 450 i I 

pPAnOO-1870 ' ■ 

pPAl 870-2680 HHH^HHHHBM 

pP refers to pET23 vector^ pM refers to pMALc vector. A refers to toxin A, arid numbers refer 
to amino add interval expressed in clone. EfKJpoints of clones correspond to indicated 
restriction sites shown of toxin A map. 
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FIGURE 12 
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HGURE 13 



100 - 






Sample 6^4 


90 - 




— a— 


Sample 6<f5 


80 - 




. . . 


— Positive Control 


70 - 




0— 


ToK A Ref Plato 2 



2 60 




Dilution Factor 

13 ho 



wo 98/08540 



PCT/US97/15394 



FIGURE 14 
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FIGURE 17 
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Summary of expression constructs from the toxin B gene 



pPBIO-1750 d 



pPBIO-1530 C 
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FIGURE 19 
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FIGURE 20 



Expression constructs from the interval 3 region 
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Binding of neutralizing CTB antibodies by recombinant toxin B protein 
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